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PYRIMIDINE DERIVATIVES AND 
OLIGONUCLEOTIDES CONTAINING SAME 

BACKGROUND OF THE INVENTIQN 

5 

This invention relates to the field of labels, particularly 
labels for diagnostic or analytical use. In particular, it relates 
to oligoaucleotides that are modified to enhance the binding 
affinity or the binding specificity of the oligonucleotides for 
complementary sequences and that in addition optionally 
bear a readily detectable characteristic. 

Sequence specific binding of oligonucleotides both to 
single stranded RNA and DNAand to duplex DNAis widely 
known. This phenomenon has been harnessed for a great 
variety of diagnostic, therapeutic and analytical, e.g., 15 
sequence determination or gene mapping, purposes. 
Previously, one objective of research in this field has been to 
increase the affinity of such oligonucleotides for their 
complementary sequences. For example, workers have 
de^scribed oligonucleotides containing 5-substituted pyrimi- 20 
dine bases that substantially increase the Tm for oligonucle- 
otide binding to complementary bases (International Publi- 
cation No. WO 93A0820). 

Publications have described the use of fluorescent 
cytosine derivatives to prepare labeled DNA probes. See 
Inoue et al., Jpn. Kokai JP 62059293, (1987). In addition, 
fluorescent labeled nucleotides have been employed in DNA 
sequencing. See Prober et al., "Science" 238:336-341 
(1987). 

l,3-Dihydro-2H-imidazo[4,5-b]-quinolin-2-one deriva- 
tives as phosphodiesterase inhibitors are disclosed by Raey- 
mackcrs ct al, (EP 541,153). 

U.S. Pat. No. 5,502,177, discloses phenoxazine 
polycycle-containing oligonucleotides and monomers for 35 
preparing the oHgonucleotides. 

OBJECTS OF THE INVENTION 

The invention compositions or methods accomplish one 40- 
or more of the following objects. 

An object of this invention is to increase the afiSnity of 
oligonucleotides for their complementary sequences. 

An object of this invention is to increase the specificity of 45 
oligonucleotides for their complementary sequences. 

Another object of this invention is to provide detectable 
labels for use in diagnostic assays. 

Another object is to enhance diagnostic assays that use 
oligonucleotides. 

Another object is to improve the therapeutic efficacy of 
oligonucleotides. 

Another object is to improve the potency of oligonucle- 
otides as aotisense reagents that affect gene expression by 
altering intracellular metabolism of complementary RNA 
sequences encoding a target gene(s). 

Another object is to provide chemical intermediates and 
synthesis methods to prepare the invention compositions. 

These and other objects of the invention will be apparent 
when one considers the disclosure as a whole. 

SUMMARY OF THE INVENTION 

In accordance with the objects, the invention provides 
compounds having the structure (1) 



(1) 




and tautomers, solvates and salts thereof, wherein 

R^ is a binding partner, a protecting group, a linker or 

— H; 

R^ is A(Z)xi' wherein A is a spacer and Z independently 
is a label bonding group optionally bonded to a detect- 
able label, but R^ is not amine, protected amine, nitro 
or cyano; 

R^'' is independently — CH=, — N=, — C(Ci.8 alkyl>= 
or — C(halogen)=, but no adjacent R^^ are both 
— ^N=, or two adjacent R^^ are taken together to form 
a ring having the structure, 




50 



where R" is independently — CH=, — N=, —0(0^.8 
alkyl)= or — C(halogen)=:, but no adjacent R" are both 
-N=; 

is — O— , — S— or — N(CH3)— ; and 
XI is 1,2 or 3. ' 

When the binding partner R^ is an oligonucleotide, 
embodiments of the compounds of this invention include 
oligonucleotides of structure (2), (2A), (2B), or (2C) 



(2) 
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-continued 

(2A) 

N 

(R*%S P=0 




wherein 

D Is — OH, protected — OH, an oligonucleotide coupling 
group or a solid support; 

is an oligonucleotide coupling group, — OH, protected 
— OH or a solid support, wherein is bonded to one 
2* or 3' position in the oligonucleotide of structure (2) 
and the adjacent 2' or 3' position in structure (2) is 
substituted with provided that D and are not 
both an oligonucleotide coupling group or they are not 
both a solid support; 

D2 is — COjR^ — C(0)N(R^)2, ^03R^ — S02N(R02 
or an activated derivative of — COjH or — SO3H; 
is a protecting group, — ^H or — (CH^2-6 — ^N(R^)2; 

R"* is independently a phosphodiester linkage or a phos- 
phodiester substihite linkage, wherein R'* is bonded to 
one 2' or 3' position in the structure (2) oligonucleotide 




4 

and the adjacent X or 3' position in structure (2) is 
substituted with R^^ 

R^ is independently — H or a protecting group; 

R^^ is independently — H, — OH, halogen or a moiety that 
^ enhances the oligonucleotide against nuclease cleav- 
age; 

R^'' is independendy — O— , — CH^— or — CF.— ; 
n is an integer from 0 to 98; and 
10 B independently is a purine or pyrimidine base or a 

protected derivative thereof, provided that at least one 

B is a base of structure (3) 

(3) 
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Embodiments include compositions useful as intermedi- 
ates in making the structure (1) compounds, including 
intermediates having structure (4) 

(4) 




45 and tautomers, solvates and salts thereof wherein, 
R^^ is a halogen; and 
R^^ is — SH, —OH, =S or =0. 
In a further embodiment, the invention includes contact- 
ing a structure (2), (2A), (2B), or (2C) oligonucleotide, 

50 wherein n is at least about 7, with a sample suspected to 
contain a nucleic acid having a base sequence that is at least 
substantially complementary to the structure (2), (2A), (2B), 
or (2C) oligonucleotide. 
In a further embodiment, the invention includes detecting 

55 the presence, absence or amount of a complex comprising a 
structure (2), (2A), (2B), or (2C) oligonucleotide, wherein n 
is at least about 7, and a nucleic acid having a base sequence 
that is at least substantially complementary to the strucmre 
(2), (2A), (2B), or (2C) oligonucleotide, 

60 In a further embodiment, the invention includes convert- 
ing a structure (4) compound to a compound of structure (1) 
where R^ is — 0 — or — S — and the R^ atom or moiety 
alpha to the ring containing R^^ is — 0 — , — S — or 
— CH2 — , by displacing R^"*. 

65 In a further embodiinent, the invention includes convert- 
ing a structure (4A) compound (a structure (1) compound 
where R^ is replaced with — NH2); to a compound of 
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Structure (1), by reductive alkylation of the — ^NHj group to pentofuranose sugars such as 2'-deoxyribose, ribose, 

yield a structure (1) compound where the R- moiety alpha to arabinose, xylose, their 2*-deoxy and 3'-deoxy derivatives, 

the ring containing R^' is — NH — , their 2'3'-didcoxy derivatives, and their derivatives contain- 
ing R"^ linked to the 2 or 3' position, usually the 2' position. 
DETAILED DESCRIPTION OF THE 5 Monosaccharide also includes the 2'3' dideoxydidehydro 

INVENTION derivative of ribose. Monosaccharides include the D- and 

The invention comprises all combinations formed by L-isomers of glucose, fiructose, mannose, idose, galactose, 

independently selecting individual Markush group members allosc, gulose, altrose, talose, fucose, erythrose, threose, 

and assembling them in accordance with the teachings lyxosc, erythrulose, riTjulose, xylulose, ribose, arabinose, 
herein. The invention optionally excludes any feature or 10 xylose, psicose, sorbose, tagatose, glyceraldehyde, dihy- 

substance found in, or obvious over, the prior art. droxyacetone and their monodeoxy derivatives such as 

Structural formulas are designated as parenthetical rhamnose. Monosaccharides are optionally protected or par- 

numerals. We intend that designation of aromaticity with ^^^^Y protected. 

respect to carbocycles and heterocycles herein includes any As used herein, a "protecting group" means a moiety that 
highly resonant unsaturated ring structure. Alternatively, prevents the atom to which it is linked from participating in 
placement of double bonds, where indicated, represents one unwanted reactions. For example, for — OR^, R^ is a pro- 
potential structure for the depicted compound but we intend tecting group for the oxygen atom found in a hydroxyl or 
this depiction to include other resonant states of the com- carboxyl group, for — SR^, R^ is a protecting group for 
pound as well as protonated and charged species, only one sulfiir in thiols for instance, and for — NHR^ or — N(R^)— , 
of which a structure may show. 20 is a nitrogen atom protecting group for primary or 
The invention includes invention compounds in secondary amines. Hydroxyl, amine and other reactive 
unpurified, substantially purified and purified forms. It groups are found in invention compounds at, e.g., R , R or 
includes invention compounds that are present with any ohgonucleotide linkages or oligonucleotide bases. These 
additional component(s) such as a solvent, reactant or groups may require protection against reactions taking place 
by-product that is present during invention compound syn- elsewhere in the molecule. The protectmg groups for 
thesis or purification, and any additional component(s) that oxygen, sulfur or nitrogen atoms are usually used to prevent 
is present during the use or manufacture of an invention unwanted reactions with electrophiUc compounds, such as 
compound. acylating or phosphorylating agents used, e.g., m 

„ I , . , o r>i r», T nucleoside, nucleotide or oligonucleotide chemistry. 

Halo and halogen mean F, CI, Br or I. m „ . . , , * , . , 

, , uju ^^ A Protecting groups are mtended to be removed by known 

Aikyl means unbranched, branched or cychc hydrocar- , * -f, ^ ^ , *u„* ««^*o^*«^ 

, /. , * J * J u - procedures, although It will be understood that the protected 
bons that are saturated or unsanirated, or combmations f ^ * r n -^u- *u e ♦w • f-^ Tn,^ 
. f All 1 • 1 J 11 • * • mtermediates fall withm the scope of this mvention. The 
thereof. Alkyl mcludes all isomers, e.g., stereoisomers, 1 * u a ^ ^^^^-^u* 
, . J. ^ J • • _ All .1 removal of the protectmg group may be arduous or straight- 
positional isomers, diastereomers and regioisomers. Alkyl ^ , . • a u.^^ ^Au^ 
^ . , * J -ii * • 11 * • 1 o a forward, depending upon the economics and naUire of the 
moieties that are unsaturated will typically contain 1, 2, 3 or • • 1 j t 1 , n « 

^„ i-iTT i-i r> JiJ^.r^u conversions mvolved. In general, one will use a protecting 

more — CH=CH — or — C=C — groups, usually one such , . P . ^. „ r *u 

^ ^ group vnih exocychc ammes m the B groups of the com- 

pounds of this invention. For oligonucleotide containing 

Substimted alkyl means unbranched, branched or cychc ^^^^ g ^ competent, exocycUc 

hydrocarbons that are sanirated or unsaturated, or combma- deprotected because tiie amine groups 
tions thereof, where the hydrocarbon contams a heteroatom 40 participate in hydrogen bonding with complementary bases, 

hnked to a carbon or a heteroatom that replaces a carbon ^in^narly, one will typically use reversible protecting groups 

atom. Substituted alkyl includes all isomers, e.g., for the 5' and 3' hydroxyl groups of pentofuranose sugars in 

stereoisomers, positional isomer^ diastereomers and regioi- nucleotides intended for use as monomers in synthesis of 

somers Substituted alkyl moieties that arcunsaturatedwill oUgonucleotides containing 3,5' linkages. Protecting groups 
typicaUy contam 12, 3 or more -CH^OI— or —C^— 45 commonly are employed to protect against covalent modi- 

C-- groups, usually one such group, Substimted aDcyl ^^^^.^^ ^ ^^^^.^.^^ ^^^^^^^^^ ^^^^ 

mcludes alkyl groups having substituents Imked to a carbon hosphorylation, alkylation or acylation. Ordinarily, protect- 

atom or substituents lhat interrupt a carbon atom chain, and ^ ^^^^^^^ ^ hydrolysis, eUmination or 

unless otherwise defined, substituents 1^^^^^^^^ ethers jhus, simple functional considerations will suf- 

^rT^rm^^^^ fice to the selection of a reversible or an irreversible 

nr'u^n^^^'^u ru n nu'^ protecting group at a given locus on the invention com- 

~NHr/ovI' Tro^NH ao^Nli^^ protecting groups and criteria for their 

"SnVZ ' ~MD 5^„^ ru" Q ^^^^^^5' Eds. "Protective Groups in Organic Synthesis" 2nd edition, 

^t.J^.^Tl^Z^ o^/h^iSTT' T/^T7; in ' Wiley Press, at pps. 10-142, 143-174, 175-223, 224-276, 

7:^?^' -f^^^S^' -S(0)OR^ -^(0)H, halogen ^77-308, 309-^5 and 406^54. 
CN, NO2, and combinations of these substiments where R^ 

is hydrogen or a protecting group. Salts 

The invention compounds herein do not include obvi- Embodiments include salts and complexes of invention 
ously unstable structures, e.g., — O — O — , — O — S — or 60 compounds, including pharmaceutically acceptable or salts 

unsaturated cyclopropyl, unless they are useful as transitory that are relatively non-toxic. The invention compounds may 

intermediates in the preparation of more stable compounds. have one or more moieties that carry at least a partial 

As used herein, "monosaccharide" means a polyhydroxy positive or negative charge in aqueous solutions, typically at 

aldehyde or ketone having the empirical formula (CH20)„ a pH of about 4-10, that can participate in forming a salt, a 
where n is 3, 4, 5, 6 or 7. Monosaccharide includes open- 65 complex, a composition with partial salt and partial complex 

chain and closed-chain forms, but will usually be closed propertiesor other noncovalent interactions, all of which we 

chain forms. Monosaccharide includes hexofuranose and refer to as a "salt(s)". Salts are usually biologically com- 



6,028,183 

7 8 

patibte or phannaceutically acceptable or non-toxic, particu- Salts also include the invention compound salts with one 

larly for mammalian cells. Salts that are biologically toxic or more amino acids. Many amino acids are suitable, espe- 

are optionally used with synthetic intermediates of invention cially the naturally-occurring amino adds found as protein 

compounds. When a water soluble composition is desired, components, although the amino acid typically is one bear- 

monovalent salts are usually preferred. 5 ing a side chain with a basic or acidic group, e.g., lysine. 

Metal salts tj'pically are prepared by reacting the metal . aiginine, histidine or glutamic add, or a neutral group such 

hydroxide with a compound of this invention. Examples of as glycine, serine, threonine, alanine, isoleudne, or leudne. 

metal salts which are optionally prepared in this way are invention compositions include compounds in their 

salts containing Li*, Na"^, and K"^. A less soluble metal salt un-iooized, as well as zwitterionic form, and combinations 

can be precipitated from the solution of a more soluble sah jq stoiochimetric amounts of water as in hydrates, 
by adding a suitable metal compound. Invention salts may 

be formed from acid addition of certain organic acids, such Stereoisomers 

as organic carboxylic acids, and inorganic acids, such as The compounds of the invention include enriched or 

alkylsulfonic adds or hydrogen halide adds, to addic or ^ resolved optical isomers at any or all asymmetric atoms as 

basic centers on invention compounds, such as basic centers are apparent from the depictions. Both racemic and dias- 

on the invention pyrimidine base analogs. Metal salts teromeric mixtures, as well as the individual optical isomers 

include ones containing Na*, li*, K-", Ca"^ or Mg-^. Other isolated or synthesized so as to be substantially free of their 

metal salts may contain aluminum, barium, strontium, enantiomeric or diastereomeric partners, are all within the 

cadmium, bismuth, arsenic or zinc ion. scope of the invention. Chiral centers may be found in 

Salt(s) of invention compounds may comprise a combi- invention compounds at, for example, R^, and R^^. 

nation of appropriate cations such as alkali and alkaline One or more of the following enumerated methods are 

earth metal ions or ammonium and quaternary ammonium us^^j ^ prepare the enantiomerically enriched or pure iso- 

ions with the add anion moiety of the phosphoric add or mere herein. The methods are listed in approximately their 

phosphonic acid group, which may be present in invention order of preference, i.e., one ordinarily should employ 

polymers or monomers. stereospecific synthesis from chriral precursors before chro- 

Salts are produced by standard methods, including dis- matographic resolution before spontaneous crystallization, 

solving free base in an aqueous, -aqueous-alcohol or Stereospecific synthesis is described in the examples, 

aqueous-organic solution containing the selected acid. Methods of this type conveniently are used when the appro- 

optionally followed by evaporating the solution. The free pnate chiral starting material is available and reaction steps 

base is reacted in an organic solution containing the acid, in 30 are chosen do not result in undesired racemization at chiral 

which case the salt usually separates directly or one can sites. One advantage of stereospecific synthesis is that it 

concentrate the solution. does not produce undesired enantiomers that must be 

Suitable amine salts include amines having sufGdcnt removed from the final product, thereby lowering overall 

basicity to form a stable salt, preferably amines of low synthetic yield. In general, those skilled in the art would 

toxicity including trialkyl amines (tripropylamine, 35 understand what starting materials and reaction conditions 

triethylamine, trimethylamine), procaine, dibenzylamine, should be used to obtain the desired enantiomerically 

N-benzyl-betaphenethylamine, ephenamine, N,N'- enriched or pure isomers by stereospecific synthesis. If an 

dibenzylethylenediamine, N-ethylpiperidine, benzylamine unexpected racemization occurs in a method thought to be 

and dicyclohexylamine. stereospecific then one needs only to use one of the foUow- 

Salts include organic sulfonic acid or oiganic cad^oxylic 40 ing separation methods to obtain the desired product, 

acid salts, made for example by addition of the acids to basic If a suitable stereospedfic synthesis cannot be empirically 

centers, typically amines. Exemplary sulfonic acids include designed or determined with routine experimentation then 

aryl sulfonic acids, C^^^ heteroaryl sulfonic adds and those skilled in the art would mm to other methods. One 

Ci_ig alkyl sulfonic acids such as phenyl sulfonic acid, method of general utility is chromatographic resolution of 

a-naphthalene sulfonic acid, p-naphthalene sulfonic acid, 45 enantiomers on chiral chromatography resins. These resins 

(S)-camphorsulfonic acid, methyl (CH3SO3H), ethyl are packed in columns, commonly called Pirkle columns, 

(C2H5SO3H), n-propyl, i-propyl, n-butyl, s-butyl, i-butyl, and are commercially available. The columns contain a 

t-butyl, pentyl and hexyl sulfonic acids. Exemplary organic chiral stationary phase. The racemate is placed in solution 

carboxylic acids include C^,^^ alkyl, C^.^^ aryl carboxylic and loaded onto the column, and thereafter separated by 

adds and heteroaryl carboxylic acids such as acetic, 50 HPLC. See for example. Proceedings Chromatographic 

glycolic, lactic, pyruvic, malonic, glutaric, tartaric, citric. Society— International Symposium on Chiral Separations, 

fumaric, succinic, malic, maleic, oxalic, hydroxymaleic, Sep. 3-4, 1987. Examples of chiral columns that could be 

benzoic, hydroxybenzoic, phenylacetic, cinnamic, salicylic, used to screen for the optimal separation technique would 

nicotinic and 2-phenoxybenzoic. include Diacel Chriacel OD, Regis Pirkle Covalent 

Invention salts include those made from inorganic adds, 55 D-phenylglycine, Regis Pirkle Type lA,Astec Cydobondll, 

e.g., HF, HCI, HBr, HI, H2SO4, H3PO4, Na2C03, K2CO3, Astec Cyclobond III, Serva Chiral D-DL»Daltosil 100, 

CaC03, MgCOj and NaQOg. Suitable anions, which are Bakerbond DNBLeu, Sumipax OA-1000, Merck Cellulose 

optionally present with a cation such a Ca**, Mg**, Li*, Na* Triacetate colunm, Astec Cyclobond I-Beta, or Regis Pirkle 

or K*^, include arsenate, arsenite formate, sorbate, chlorate, Covalent D-Naphthylalanine. Not all of these columns are 

perchlorate, periodate, dichromate, glycodeoxycholate, 60 likely to be effective with every racemic mixture. However, 

cholate, deoxycholate, desoxycholate, taurocholate, those skilled in the art understand that a certain amount of 

taurodeoxycholate, taurolithocholate, tetraborate, nitrate, routine screening may be required to identify the most 

nitrite, sulfite, sulfamate, hyposulfite, bisulfite, effective stationary phase. When using such columns it is 

metabisulfite, thiosulfate, thiocyanate, silicate, metasilicate, desirable to employ embodiments of the compounds of this 

CN~, gluconate, gulcuronate, hippurate, picrate, 65 invention in which the charges are not neutralized, e.g., 

hydrosulfite, hexafluorophosphate, hypochlorite, where acidic functionalities such as carboxyl are not esteri- 

hypochlorate, borate, metaborate, mngstate and urate. fied or amidated. 
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Another method entails converting the enantiomers in the polycycle and is termed the polycyclic substructure. 

mixture to diasteriomers with chiral auxiliaries and then consists of two principal structural features denominated 

separating the conjugates by ordinary column chromatog- — A(T)xi' Group A is a spacer that is used to position the Z 

raphy. This is a very suitable method, particularly when the group(s) and attach it to the remainder of the polycycle. The 
embodiment contains free carboxyl, amino or hydroxyl that s Z group(s) serve as a site for attachment of a detectable label 

will form a salt or covalent bond to a chiral auxiliary. or to enhance the hydrogen bonding of the polycycle to the 

Chirally pure amino acids, organic acids or organosulfonic complementary guanine base. In some embodiments, Z is 

acids arc all worthwhile exploring as chiral auxiliaries, all of capable of performing both functions. For the most part, Z 

which are well known in the art. Salts with such auxiliaries groups capable of hydrogen bonding are useful as sites for 
can be fonned, or they can be covalently (but reversibly) lo covalent bonding to detectable labels, but not all Z groups 

bonded to the functional group. For example, pure D or L that are useful as label-bonding sites are capable of hydro- 

amino acids can be used to amidate the carboxyl group of gen bonding to guanine. Z contains at least one atom other 

embodiments of this invention and then separated by chro- than carbon, typically O, N or S. In any case, the R^ groups 

matography. Enzymatic resolution is another method of possess at least one of these practical utilities. It would be 
potential value. In such methods one prepares covalent is routine to make and test them to determine the best use of 

derivatives of the enantiomers in the racemic mixture, any one embodiment. 

generally lower alkyl esters (for example of carboxyl), and 2 groups capable of base-pairing are believed to hydrogen 

then exposes the derivative to enzymatic cleavage, generally bond with of guanine in a complementary nucleic acid 

hydrolysis. For this method to be successful an enzyme must sequence when incorporated into a polycyclic substrucUire- 
be chosen that is capable of stereospecific cleavage, so it is 20 substituted oligonucleotide. The resulting duplex has greater 

frequently necessary to routinely screen several enzymes. If stability than one containing a native GC pair because the R^ 

esters are to be cleaved, then one selects ia group of esterases, g^up provides an additional point for hydrogen bonding to 

phosphatases, and lipases and determines their activity on j^e complementary guanine base. Thus, these embodiments 

the derivative. Typical esterases are from liver, pancreas or serve as cytosine surrogates for supplemented Watson-Crick 
other animal organs, and include porcine liver esterase. 25 base-pairing. In general, a base-pairing substituent Z is 

If the enatiomeric mixUire separates from solution or a defined functionally as any group that, when taken together 

melt as a conglomerate, i.e., a mixture of enantiomerically- with the remainder of R^ is capable of increasing the 

pure crystals, then the crystals can be mechanically temperature of melting of any of the oligonucleotides 3-9 in 

separated, thereby producing the enantiomerically enriched Table 1 by at least about 2 degrees Centigrade when sub- 
preparation. This method, however, is not practical for large 30 stituted as shown in Table 1. 

scale preparations and is of no value for true racemic jf ^2 ^^^^ ^^^^^^ ^ substimem that is capable of 

compounds. contributing to base pairing or hydrogen bonding then R^ is 

Asymmetric synthesis is another technique for achieving useful at least as a point of attachment for a detectable label, 

enantiomeric enrichment. For example, a chiral protecting Such Z groups need only be reactive with a bifunctional 
group is reacted with the group to be protected and the 35 4;ross-linking agent or with the label direcUy. In some 

reaction mixture allowed lo equilibrate. If the reaction is embodiments, the polycyclic substructure is itself 

enantiomerically specific then the product will be enriched fluorescent, and in these cases it is not necessary to link the 

in that enantiomer. 2 group to a detectable label. In these embodiments the 

Further guidance in the separation of enantiomeric mix- polycyclic substructure is detectable by fluorescent 

turcs can be found, by way of example and not limitation, in emissions, or by adsorption and energy transfer to an emit- 

" Enantiomers, Racemates, and resolutions", Jean Jacques, ting (second) label present on a binding partner in the same 

Andre Collet, and Samuel H. Wilen (Krieger Publishing fashion as is used in EMIT technologies well-known in the 

Company, Malabar, Fla,, 1991, ISBN 0-89464-618-4): Part diagnostics field, 

2, Resolution of Enantiomer Mixture, pages 217-435; more ^^^^^ ^ substituted with from 1 to 3 Z groups. When 

particulariy, section 4, ResoluUon by Du-ect CrystalUzation, zi&z base-pairing hydrogen bonding group then "XI" is 

pages 217-251, section 5, Formation and Separation of preferably 1 or 2, ordinarily 1. Similarly, for reasons of steric 

Diastereomers, pages 251-369, section 6, Crystallization- ^^^^ ^ preferred that only 1 or 2 Z groups are present on 

Induced Asymmetric Transfonnations, pages 369-378; and ^ ^ ^jj^^, ^2 ^ intended to function as a label bonding 

section 7, Experimental Aspects and Art of Resolutions, ^jj^ 

pages 378-435; still more particularly, section 5,1.4, Reso- , u j- . a j *u t 

lution of Alcohols, Transformation of Alcohols into Salt- /° f embodiments he spacer group A and the Z 

Fomiing Derivatives, pages 26S-266. section 5.2.3, Cova- substituent(s) will interact fiinct.onally. i.e changes in 

1 *T^ - c \\ u \ T-ui A Tku 1 group A may have an impact on the physical or chemical 

lent Denvatives of Alcohols, Thiols^ and Phenols, pages ^ i « j • i •« u 

-51-^ '»-»c c 1 4 T> 1 *• CK A ^cn^c\ properties of Z, and vice-versa. For example, it will be 

332-335, section 5.1.1, Resolution of Aads, pages 257-259, / . . < i -h j • *i. * *u * u u 

J ^ 1 • c r^rr, uudcrstood bv those skilled m the art that changes can be 

section 5.1.2, Resoluuon of Bases, pages 259-260, section uu^vi^iv^^u yy f^tT" , , , . 

5.1.3, Resolution of Amino Acids, page 261-263, section ^ spa,^r A that would reduce or mcrea^ the 

5^.1. Covalent Derivatives of Acidsf page 329, section "bihty of Z to hydrogen bond or to react with a labe or 

e r> 1 * J • c A ' 4 -in -5-51 cross-linking agent, A readily apparent mstance of this 

5.2.2, Covalent denvatives of Ammes, pages 330-331, f .° . _ . . 

ern A r> rxijuj v. would bc substitutiou of A With clectrou douors or acccptors 

section 5.2.4, Covalent Denvatives of Aldehydes, Ketones, . . „ ... fr^^*u„A ^« 

and Sulfoxides, pages 33^339, and section 5.2.7, Chro- proxmial to a Z group, which may affect hydroger^ bonding 

matographic B;havior of Covalent Diastereomers, pages ^^^^'r "^^7' ^ T'^P . ^ 

348-354 ^ consider these domams to be functionally and structurally 

discrete taking into account interdomain interactions that 

Compounds of Structure (1)— Polycyclic would be apparent to the ordinary artisan. 

Substructure Spacer A typically contains a backbone chain of 2, 3, 4, 

R^ is a key functionality. It is substituted on the polycycle 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or 16 carbon atoms, any 

depicted in structure (1), less R^. The combination of the 1, 2 or 3 of which are optionally replaced with N, O or S 
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atoms, usually 1 N, 0 or S atom. The backbone chain refers 
to the atoms that connect the Z group(s) to the ring carbon 
atom at the binding site on the polycycle. The number of 
spacer backbone atoms does not include terminal Z group 
atoms. does not include protected amine as described in 
U.S. Pat. No. 5^02,177. 

The spacer A backbone is linear or one or more backbone 
atoms are substituted, which results in branching. 
Ordinarily, when 1 Z group is present then A will contain a 
linear backbone of 2 to 8, usually 2 to 4 atoms. The 
backbone generally is carbon only, bonded by saturated or 
unsaturated bonds. If unsatiu^ated bonds are present, the 
backbone generally will contain 1 to 2 double or triple 
bonds. Preferably, the backbone is saturated. If a heteroatom 
is present in the backbone it typically will be O or S. 
Preferably the heteroatom is 0, and preferably only 
present in the backbone chain. Heteroatoms are used 
replace any of the backbone carbon atoms, but preferably 
used to replace the carbon atom alpha (adjacent) to the 
polycycUc ring. Usually the atom in the spacer chain that is 
bonded to the polycyclic substructure is unsubstituted, e.g., 
— O — y — S — , — NH — or — CHj — , and, in general, the 
next 1, 2 or 3 atoms in the spacer are unsubstituted carbon. 

The spacer A bacld^one is optionally substituted indepen- 
dently with 1, 2 or 3 of the following: C^-Cg alkyl, — OR^, 
=0, — NO2, — N3, — COOR^ — N(R^)2, or — CN groups, 
Ci-Cg alkyl substituted with —OH, =0, — NO2, — N3, 
— COOR^, — N(R^)2, or — CN groups, or any of the fore- 
going in which — CH2 — is replaced with — O — , — NH — 
or — N(Ci-C8 alkyl), wherein R^ is H or a protecting group. 
Certain of these groups may function as Z sites for linking 
to detectable labels, but need not be used for that purpose 
unless desired. In some embodiments these substituents are 
useful in increasing the lipophilicity of the compounds of 
this invention. 

Group Z detectable labels include all of the conventional 
assayable substances used heretofore in labeling oligonucle- 
otides or proteins. Examples are well known and include 
fluorescent moieties such as fluorescein, chemiluminescent 
substances, radioisotopes, chromogens, or enzymes such as 
horseradish peroxidase. For the purposes herein, the residue 
of any bifunctional or multifunctional agent used to 
crosslink the Z group(s) to the A backbone is defined to be 
part of the Z group, and the residue of the detectable label 
is considered also to represent part of Z. 

Group Z also encompasses substituents that are not 
delectable by conventional diagnostic means used in clinical 
chemistry settings (e.g., UV or visible light absorption or 
emission, scintillation or gamma counting, or the like) but 
which are nonetheless capable of reacting with a crosslink- 
ing agent or a detectable label to form a covalent bond. In 
this regard, the Z groups function as intermediates in the 
synthesis of the labelled reagent. Typical Z groups useful for 
this purpose include — NH., -^HO, — ^H, — CO2Y or OY. 
where Y is H, 2-hydroxypyridine, N-hydroxysuccinimide, 
p-nitrbphenyl, acylimidazole, maleimide, trifluoroacetate, 
an imido, a sulfonate, an imine 1,2-cyclohexanedione, gly- 
oxal or an alp ha -halo ketone. Suitable spacers, reactive 
groups and detectable labels have been described, e.g., U.S. 
Pat. Nos. 5,668,266, 5,659,022, 5,646,261, 5,629,153, 
5,525,465 and 5,260,433, WO 88/10264, WO 97/31008, EP 
063 879 Bl, Urdca "NAR" 16:4937-4956 (1988), Prober 
"Science" 238:336-341 (1987). 

Z also is a hydrogen bond donor moiety or a moiety, when 
taken together with the influence of spacer A, has a net 
positive charge of at least about +0.5 at pH 6-8 in aqueous 
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1 ois I * 

sed toi^Mz/O 

Wyaref 
to the ("^Jl 



solutions. Such Z groups are designated R^. In these 
embodiments, R^ is covalently linked to a short spacer A 
having a backbone (otherwise described above) of 2, 3, 4, 5 
or 6 atoms, designated R^. 

The R^ short spacer chain backbone atoms are C atoms 
and optionally one or two atoms independently selected 
from the group consisting of 0, N or S atoms. R^ short 
spacer chain backbones include unbranched and branched 
alkyl that optionally contain one or two independently 
selected O, N or S atoms. Usually R^ is unbranched, i.e. the 
backbone has no hydrocarbon substituents. Any branching, 
if present, will usually consist of a C^-C^ alkyl group, 
usuaUy a methyl or ethyl group, or C^-C^ alkyl substituted 
with —on, =0, — 0(Ci-<:3 alkyl), — CN, N3 or 1, 2, 3 or 
4 halogen atoms. 

Exempl^ — R^*^ — R^ and related structures are 
— BT— (CHA— NR^qNR5)(NR\, including— O— 
(CM,)— NR^C(NR^)(NR^)2, — NH— <CH2) — NR^C 



20 



25 



30 



),— NR^(NR^)(NR^)2, 
'XNR\ and — (CH2)2.5NR'C(NR^)(NR^)2, 

— R^ — CH2— CHR^'— N(R^)5 
_r6_(r7) N(R^)2, -R'^-(CHAN(R^): 
-^CH2);^^(R^)2, ^CH^— 0-(CH2),— N(R% 



N(R^)2,and 



'N(R3)2, 



(40) 



(41) 



where R** is . usually • 



35 



40 



45 



50 



55 



60 



d29 

-NR5 



Q 



(42) 



(43) 



(44) 



(45) 



(46) 



65 



where R^ is usually — O — and R® is usually — CII2 — or 
— CH2CH2— in structures (42)-<46) and adjacent R^ and R® 
are not — 0 — O — , — 0 — S — or ■ 
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is independently — H, — CH3, — CH2CH3, 
— (CH^^ — N(R^^)2 or a protecting group, usually — 
or — CH3, 

or, both together are joined to form a protecting group, 
or, when R^ is — R^(CH2)^(R^)2, one R^ is H, CH3, 
CH2CH3, a protecting group or — (CHj).^— N(R^^)2 
and the other R^ is — H, — CH3, — CH2CH3, 
-(CH2)^-N(R^%, -CH(N[R^^]2)-N(R^% 



(48) 



K3' 

K3- 



10 



15 



(49) 



(50) 



R^^ is — H, — CH3, — CH2CH3, a protecting group, a 
monosaccharide, where the monosaccharide is usually 
linked at the monosaccharide's 1* position and where any 
monosaccharide hydroxy! groups are optionally protected, 
typically an R^* monosaccharide is 2*-deoxyribose, a 
2'-deoxy-2-R^^ -substituted ribose or arabinose such as 
2'-deoxy-2'-fluororibose or 2'-deoxy-2'-fluoroarabinose, or 
the monosaccharide is ribose or arabinose; 

t is 1, 2, 3 or 4, but when R^ is — O — , — S — or 

—NR^— , t is 2,3 or 4; 
V is independently 0, 1 or 2; and 
w is 1 or 2. 

Invention embodiments include R^ moieties having the 
structure — R^^— NHj where R^^ has the structure — R^— 
R*°— , including — RMCHj),— N(R3)2, where R^ is usu- 
ally —0—, , — NH— or — CH2— , R'^° is 
— CHR^'— (CHR^')23--(R^')zi— (CHR^Oz^^HR"— 



20 




N— R»; 



'22~ 

- — S— , qO), — CHR^ or —NR^-— and 



usually when one R^ is — H, — CH3, — CH2CH3, or 
—(Cil^—N(R^\, the other R^ is — H or a protecting 30 
group; 

R^ is independently — or a protecting group; 
R"* is independently — S— , — NR^— , — O— or . 
—CU^—; 

R^ is independently linear alkyl having 1, 2, 3 or 4 carbon 
atoms, linear alkyl having 2, 3 or 4 carbon atoms and 
containing one — CH=CH — , — C^C — or — CHj — 
0 — CH2 — moiety, or R^ is cyclic alkyl having 3, 4 or 
5 carbon atoms, wherein one of the linear alkyl carbon 

. atoms is optionally substituted with a single — CII3, 
— CN, =0, — OH or protected hydroxyl, provided that 
the carbon atoms in any — CH=CH — or — CH2 — 
O — CH2 — moiety are not substituted with =0, — OH 
or protected hydroxyl, and usually R^ is — CHj — , 
— CH2— CH2— , — CH2--CH2— CH2— or — CH2— 
CH2 — CH2~7CH2 — ; 

R® is linear alkyl having 1 or 2 carbon atoms wherein one 
of the linear alkylene carbon atoms is optionally sub- 
stituted with a single — CH3, — CN, =0, — OH or 
protected hydroxyl, or R® is absent and R^ is linked 
directly to the ring in R^ sUiictures (42)-(46), usually 
R® is — CH2— or — CH2— CH2— ; 

R^ is independently —CU^— —CH(CH^, — CH 
(0CH3>— , — CH(OR^)— or — O— , but both are not 

R2^ is ' independently — N— , — N(CH3>-, — CH— , 
— C(CH3)— , but both are not — N(CH3)— ; 

R^° is — H or — N(R^)2, usually — H or — NH2; 

R^^ is the side chain of an amino acid, usually the side 50 
chain of a naturally occurring amino acid, e.g. glycine, 
alanine, valine, isovaline, leucine, threonine, serine, 
lysine or arginine; 

R^^ is independently — H, — CH3, — CH2CH3 or a pro- 
tecting group; 65 

R^^ is H, Cj-C^ alkyl (including — CH3, -<:H2CH3) or 
a protecting group, usually — H or a protecting group; 



usually 0, 1 or 2 R^^ are methyl or ethyl; R^^ independently 
is — H, methyl or ethyl; Z3 is 1, 2 or 3, usually 1; Zl is 0 

or 1, usually 0; and Z2 is 1, 2 or 3, usually 1. In these 
embodiments, any functional groups, e.g., — OH, — NH2, 
25 — COOH, or — ^SH, that are optionally present at R^^ are 
usually protected. These embodiments are useful as inter- 
mediates useful to make monomers for oligonucleotide 
synthesis. 

Other invention embodiments include R^ moieties having 
the structure 



35 



40 



45 



50 



55 



including 



NR5 



-R*— R^'-NH II ^NHR^ 



NR-^ 



including (CH2)i-5NH ^ ^NHR^; 

R»-R29 

3 (CH2)l-3-R^— 

including H t 



NH ^R^— R* 



N 



, including 



"NH(CH2)i.2-R^ 



; and 



(R*)2N N R^'-R* , including 
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-continued 



(R^ 




'(CHa)i-3 — 



-H, alkyl having 1, 2, 3 or 4 carbon atoms or 
optionally protected substituted alkyl having 1, 2, 3, 4, 5 or 
6 carbon atoms including — CH3 and — CH2CH3, and R^^ is 
— H, — NHj or — NHCCHg). Other embodiments are 
— RMCH2)2.8— NH^, -RMCH2)2.8— OR^ or -R^— 
(CH2)2.8— CO^R^. 

Compounds of Structure (1) — Substituent R^ — 
Linker 

One uses R^ linker groups to covalently bond the inven- 
tion base to the selected binding partner, although it vnU be 
understood that this need not be the sole use for the linker 
functionality. Thus, a group present in R^ linkers principally 
serves as the site for covalently bonding the invention base 
to a binding partner, typically by incorporating the invention 
base via the linker residue into a polymeric binding partner 
by grafting or copolymerization. 

R^ linkers also optionally are substituted with groups that 
ordinarily will not participate in binding to the binding 
partner, e.g., halo, azido and protected bydroxyl. Generally, 
such linker groups will contain from 2 to about 50 atoms. If 
it contains a cycle the cyclic functionality typically will be 
an oxygen, sulfur or phosphorus-containing saturated or 
unsaturated heterocycle having a total of about from 5 to 7 
ring atoms and 1 to 3 heteroatoms. For the most part, the 
cycle will be a monosaccharide, typically (i) a hexose, (ii) a 
hexose such as glucose substituted with phosphate, pro- 
tected phosphate, hydrogen phosphonate, a 
phosphoramidate, hydroxy! or protected hydroxyl, (iii) a 
fiiranose or (iv) a furanose substituted with phosphate, 
protected phosphate, hydrogen phosphonate, a 
phosphoramidate, hydroxyl or protected hydroxyl. Typical 
furanose sugars include ribose, 2 -deoxyribose, 2'-deoxy-2'- 
R^^-substituted ribose and their 2' ara isomers. Ordinarily, 
R* is an abasic nucleotide residue or such a residue deriva- 
tized so as to be capable of incorporation into an oligonucle- 
otide. 

Thus, the R^ linker frequently comprises an activated 
group or other group which can react with a polymer or other 
binding partner to be labeled with the polycyclic substruc- 
ture. For example, groups described below that are compat- 
ible with commonly available oligonucleotide synthetic 
chemistries are useful Other examples of reactani groups for 
covalent labeling are well-known from the diagnostic fields 
and have heretofore been used commonly to label proteins 
and oligonucleotide probes, as is more fully discussed 
below. 

In one embodiment, R^ is a bifunctional or multifunc- 
tional organic linker group such as alkyl, alkene, alkyne, 
alkoxyalkyl, alkylthioalkyl, alkoxy, saturated or unsaturated 
heterocycle that is substituted with at least one group 
capable of being crosslinked with or incorporated into a 
polymer, e.g., such groups as hydroxy, amino, carboxyl, 
vinyl, phosphate or phosphonate. U.S. Pat. No. 5,502,177 
describes suitable linker groups. An example of such an R^ 
linker suitable for . oligonucleotide synthesis is protected 
monosaccharides, such as ribofuranose and deoxyribofura- 
nose sugars of structure (5) 



(5) 




where an invention base is Unked to the open valence at the 
1' position, D is hydroxyl, protected hydroxyl or is an 
oligonucleotide coupling group and is independently R^^ 
or an oligonucleotide coupling group, but both are not 
coupling groups. 

In embodiments of the invention where the compound of 

^5 structure (1) is to be used as a monomer in the preparation 
of oligonucleotides, R^ is typically structure (5) where one 
is an oligonucleotide coupling group and D is — OH or 
protected hydroxyl. 
''Coupling group" as used herein means any group suit- 

20 able for generating a phosphodiester linkage or phosphodi- 
ester substitute linkage between nucleotide bases or their 
analogs. These coupling groups are conventional and well- 
known for the preparation of oligonucleotides, and are 
prepared and used in the same fashion here. They are usually 

25 configured as the b anomers as denoted in structure (5) or 
optionally as the alpha anomers. In general, each compound 
comprising structure (5) will contain two coupling groups: D 
or D^, but with only one being a coupling group. The 
coupling groups are used as intermediates in the preparation 

30 of 3',5' 5*3', 5',2' and 2',5' intemucleotidc linkages in accord 
with known methods. 

Suitable coupling groups for phosphodiester linkages or 
phosphodiester substitute linkages containing phosphorus 
include OH, H-phosphonate; (for amidite chemistries) alky- 

35 Iphosphonamidites or phosphoramidites such as 
P-cyanoethylphosphoramidite, N,N-diisopropylamino-p- 
cyanoethoxyphosphine, N,N-diisopropylamino- 
methoxyphosphine, N,N-diethylamino-methoxyphosphine, 
N,N-diethylamino-P-cyanoethoxyphosphine, 

40 N-morpholino-P-cyanoethoxyphosphine, N-morpholino 
methoxyphosphine, bismorpholino-phosphine, N,N- 
dimethylamino-p-cyanoethylmercapto-phosphine, N,N- 
dimethylamino-2,4-dichlorobenzylmercaptophosphine, and 
bis(N,N-diisopropylamino)-phosphine; and (for triester 

45 chemistries) 2-, or 4-chlorophenyl phosphate, 2,4- 
dichlorophenyl phosphate, or 2,4-dibromophenyl phosphate. 
See for example U.S. Pat. Nos. 4,725,677; 4,973,679; 4,997, 
927; 4,415,732; 4,458,066; 5,047,524; 4,959,463; 5,624, 
621; and International Publication Nos. WO 97/14706 and 

50 WO 92/07864. 

For structure (2) embodiments, if is a coupling group 
then D typically will be hydroxyl protected with a group 
suitable tor ensuring that the monomer is added to the 
oligonucleotide rather than dimerizing. Such groups are well 

55 known and include DMT, MMT, FMOC 
(9-fluorenylmethoxycarbonyl), PAC (phenoxyacetyl), a tri- 
alkyl (Cj-Cg alkyl, each alkyl group is independently 
chosen) silyl ether or an alkyl (Cj-Cg alkyl) diaryl (e.g., 
phenyl) silyl ether sucb as TBDMS (t-butyldiphenylsilyl) 

60 and TMS (trimethylsilyl). The opposite will apply when one 
desires to synthesize an oligonucleotide in the opposite 
direction (5'-»3'). Ordinarily in structure (5) compounds, D 
is DMT, D^ is located on the 3' carbon, R^^ is H and the D^ 
and R^^ groups are in the alpha anomer conformation. 

65 As noted above, R^ includes an optionally protected 
monosaccharides of structure (4) and (4A). Usually the 
monosaccharides in structure (4) and (4A) compounds are 
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2'-deoxyribosc, 2'-deoxy-2'-R^^ -substituted ribose, 
2'-deoxy-2*-R^* -substituted arabinose, nbose or arabinose, 
any of which arc optionally protected at sugar all or some 
hydroxyls or at optionally present R^^ functional groups 
such as — OH — SH or — NH2 groups. 

Invention embodiments include compositions of the for- 
mula 




where B is a structure (3), (4) or (4A) base, R^^ is a diene 
or a dieneophile, both as defined in WO 97/14706, R^'' is 
— OR^, a coupling group including — OH, H-phosphonate, 
a phosphoramidite or an optionally protected oligonucle- 
otide having a 3'-terminal group selected from a coupling 
group and — OR^ and any reactive moiety in R^- is option- 
ally protected, R^^ dienes are independently chosen and 
include 2,4-hexadiene and 3,5-hexadiene. One or both R^^ 
are linked to a solid support such as a crosslinked organic 
polymer, polystyrene, TentageF", polyethylene glycol or an 
inorganic oxide such as silica gel, alumina, controlled pore 
glass or a zeolite. These compositions are useful for making 
oligonucleotides containing one or more invention bases. 

Invention embodiments include compositions and their 
isomers of the formula 




N(R5)(R58) N(R*)(R58) 



where R^^ is independently — H, a protecting group or both 
R^"^ together are a dihydroxy protecting group, R^® is — ^H or 
alkyl containing 1, 2, 3 or 4 carbon atoms and B is a structure 
(3), (4) or (4A) base. Suitable R^ at the nitrogen atom 
include — H, FMOC and tBOC (t-butyloxycarbonyl) and 
suitable R^ at the caiboxyl group include — H, t-butyl and 
benzyl, see, e.g., WO 97/14709. These compositions are 
useful for making oligonucleotides containing one or more 
invention bases. 

Protecting groups suitable for use with amine groups that 
may be present at R^ include FMOC and trichloroacetamide. 
Monomers and polymers may contain such protecting 
groups at R^. 

Substituent R^ — Binding Partner 

R^, when functioning as a binding partner, is a substance 
that noD-covalently binds to a target compound. Generally, 
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the target compound is an analyte whose presence is desired 
to be detected. Binding partners are well-known from the 
immimoassay art and include hapten-antibody pairs such as 
those used in drug immunoassays using EMIT or ELISA 

5 technologies. Binding partners are employed analytically in 
enzymology, where the substrate or the enzyme is labeled. 
Binding partners also are known from the ohgonucleotide 
hybridization art, including oligonucleotide-nucleic acid 
binding partners (as in diagnostic probes or therapeutic 
antisense oUgonucleotides) or oligonucleotide-protein bind- 
ing partners (aptamers). In accordance with this invention, 
an invention base is substituted at R^ by any binding partner. 
While the binding partner may be a small molecule such as 
a drug, hapten, substrate or the like, ordinarily it is a 

j5 polymer. 

Compounds of structure (1) wherein R^ is a polymer are 
an important feature of this invention. For the most part, 
when R^ is a polymer an R^ linker group has been subsumed 
into the polymer structure, either as a monomer unit or by 

2Q grafting onto pre-existing polymer. Therefore, when R^ is a 
polymer, the polymer may comprise the residue of a linking 
group derived from a monomer or where the linking group 
differs from the polymer's monomeric subunits. The inven- 
tion base must be covalently linked to the polymer. 

25 The nature of the polymer is not critical. Typical R^ 
polymers include a biopolymer such as an oligonucleotide, 
a protein (including antibodies, enzymes, cell membrane 
proteins, glycoproteins, glycolipids, lipoproteins and 
Ducleoproteins), a peptide, a nucleic acid, or a glycan or 

30 other polysaccharide or carbohydrate. In . certain embodi- 
ments the polymer is an ohgonucleotide in which either or 
both of the sugar or phosphodiester monomer subunits are 
substituted by groups that continue to permit base pairing by 
the invention base analogs but which have other desirable 

35 characteristics that are not shared with native substituents, 
e.g., those which mask the negative charges of the phos- 
phodiester . linkages or replace the phosphodiester hnkage 
with another group. 
The site at which one links the invention base analogs to 

40 a polymer is typically not critical. In general, any reactive 
group on the polymer is satisfactory when one wants to graft 
the polycycle-R^ substructure onto a pre-existing polymer. 
Obviously, the site of the substitution should not be in a 
location in which the polycycle-R^ substructure will inter- 

45 fere \^ith the intended function for the polymer, e.g. enzyme 
active site, antibody CDR, and the like as will be understood 
by the artisan. An amino acid side chain such as that of 
lysine, glutamic acid, serine, asparagine and the like will be 
satisfactory for grafting to protein R^, as ^^'ill alpha amino 

50 groups, provided that the amino acids in question do not 
participate in the binding partner or ligand/substrate inter- 
action involved in the assay in which the labeled protein is 
to be used. One appUes the same reasoning to select a 
binding site or sites on other analytes such as sugars, 

55 glycans, lipids, and the like. For example, the 1* position of 
ribose or deoxyribose is satisfactory as the site of substitu- 
tion of an oligonucleotide by the invention base analogs. 
Suitable sites will be known to the artisan, particularly in 
those instances where the one intends to substitute an 

60 invention base analog for purine or pyrimidine bases, usu- 
ally for cytosine, or for fluorescent labels. 

The degree of polymer substitution by the invention base 
analogs is not critical. One skilled in the art will choose the 
reaction conditions such that the resulting labeled polymer 

65 will be substituted with sufiBcient molar proportion of base 
analog to facilitate its use in the desired analytical, thera- 
peutic or preparative procedure. This is accomplished by 
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preparing the labeled polymeis under a variety of oonven- binding affinity for a given application. In addition, embodi- 

tional conditions, e.g., the time, temperature or duration of ments include short oligonucleotides or oligonucleotide 

the labeling reaction, to yield a matrix of multiply-labeled domains, e.g., having about 2, 3, 4. 5 or 6 Mxd monomeis, 

polymers. These then are screened for suitabiUty in the ^^ere the domain may have low bmdmg afiBmty, but even m 

intended appUcation. Molar ratios of about from 1:1 to 10:1 5 ^is «ise "e useful as mtemiediates to make longer oUgo- 

invention bL to polvmer generally are suitable. Where the n"«l««'ides so as to mcrease aflmity sufficiently to confer a 

I , , ' r Tm of at least about IS** C Generally, invention oligonucle- 

labeled polymer is prepared by monomer incorporatioo^he ^^^^ ^ ^ ^^^^^ ^^^^^ 4' J ^^^^ 

resulung polymer may contain about from 1% to 100% ,5% or less, of monomers that significantly reduce binding 

mvention base jmalog substitution. In this embodiment each -^^ monomeis that decrease the Tm more than 

invention base is considered a monomer unit (even though lO ^^^^ 2° C. per monomer, compared to a corresponding 

the polymer may have been assembled from intermediate unmodified oUgonucleotide. 

synthons containing 2 or more invention bases per synthon). invention embodiments include protected, partially pro- 

Oligonucleotides are polymers containing at least 2 tected and deprotected monomers and polymers including 

covalently linked nucleotides or nucleotide analogs oligonucleotides. Partially protected compounds arise dur- 

(collectively monomers), at least one of which comprises an 15 ing the course of deprotection and they are thus intermedi- 

invention base. In oligonucleotide invention embodiments at ates in the process of preparing deprotected compounds, 

least one invention base is covalently linked to a nucleotide Typically, one would not recover partially deprotected com- 

sugar, and typical invention oligonucleotides will contain pounds. Deprotected compounds have been subject to a 

about 2-75% of the bases as invention base analogs, usually treatment that removes the protecting group(s), although the 

about 5-25%. Small oligonucleotides, e.g., 2-6-mers, that 20 preparation may contain some compounds with unremoved 

serve as synthetic intermediates for larger oligonucleofides protecting groups. Typically, any remaining protecting 

will optionally contain the higher proportions of invention groups that remain after deprotection are present in small 

base analogs, e.g., about 50-75%. Larger oligonucleotides, amounts that may be removeid by suitable purification meth- 

e.g., about 7-21-mers, will generally contain 1, 2, 3, or 4 ods if desired. 

invention bases, occasionally 5 and usually not more than ^5 Invention embodiments include oligonucleotides of struc- 

about 5 invention bases, unless the oligonucleotide is rela- ture (2) where R^^ is oxygen. Invention oligonucleotides, 

tively long, e.g., about 22-50-mer. including those where R^'^ is oxygen, typically contain 2, 3, 

Invention embodiments include polymers and oligonucle- 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 

otides where the invention bases are located on 2, 3 or more 22, 23 or 24 linked monomers usually about 5-21. Such 

adjacent monomers or nucleotide residues, or the invention oligonucleotides optionally contain about 30-100%, typi- 

bases may be located on monomers or nucleotide residues cally about 60-100%, of the linkages as phosphodiester or 

that are separated from each other by about 1, 2, 3, 4, 6, 8, phosphorothioate linkages, or other linkages of similar bind- 

10, 12, 15, 18 or more monomers or nucleotide residues that ing afiSnity. 

do not contain these bases. When a delectable label is linked Invention oligonucleotides include support-bound 

to an invention base at R^, the oligonucleotide may contain oligonucleotides, which are typically used in solid phase 

1, 2 or 3 of these labeled monomers, usiially 1 or 2. Such synthesis and separation applications. Support-bound oligo- 

labelled monomers are often located at the 3' or 5' terminus, nucleotides are typically protected during synthesis, e.g., 

but they may reside at an internal position such as one, two bases, sugar hydroxyls, linkages and functional groups 

or more monomer residues from either terminus. optionally present are protected as needed, e.g., a sugar 

Invention oligonucleotides, which conUin 1, 2, 3 or more hydroxyl group present at R^, an amine group at R^ or a 

invention bases, will typically have suflScient binding afiBn- ^ hydroxyl or amine group at R^. In these embodiments, R^ 

ity for complementary nucleic acid sequences to allow facile is covalently linked to a solid support or R^ is an oligo- 

detection of the duplex or triplex resulting from the base nucleotide linked to a solid support. When one removes 

sequence-specific binding interaction. Typically, an inven- invention oligonucleotides from a support, the protecting 

tion oligonucleotide will have a Tm of at least about 15° C, groups are generally removed at the same time or shortly 

usually at least about 20*" C, when tested under typical in 45 thereafter. 

vitro binding conditions, such as those described herein and Invention embodiments include highly lipophilic poly- 

elsewhere, (Jones, "J Org Chem" [hereafter "JOC" ] mers and oligonucleotides that comprise (i) one or more 

58:2983-91 1993, Froehler, "Tet. Lcll." 34:1003-06 1993). structure (3) bases, usually about 1, 2, 3 or 4, and (ii) 

Complementary nucleic acid means a natural or synthetic lipophilic modifications such that the polymer or oligonucle- 

compound that is capable of forming a hydrogen bonded 50 otide has an octanol:water partition coefficient of about -0.5 

complex in a sequence-specific manner with an invention to about 2.5, typically about 0.0-2.0, usually about 0.2-1.5, 

oUgonucleotide such as a structure (2) oligonucleotide. and a solubility in water of at least 0.001 /ig/mL, usually at 

Complementary nucleic acid base sequences contain no least 0.1 /<g/mL. 

mismatches, while "substantially complementary" base One can use such highly lipophilic polymers and oligo- 

sequences contain only a limited number of mismatches, nucleotides as reagents to stain, detect or visuahze Uving 

e.g., at most about 1 mismatch per about 15-20 bases, ceUs in vitro or in vivo, as described in U.S. Pat. No. 

relative to an invention oUgonucleotide. 5,633360 and in >^pUcation No. PCT US 96/12530. These 

One optionally measures the binding of an invention highly UpophiUc polymers and oUgonucleotides need not be 

oUgonucleotide to a complementary nucleic acid by detect- binding competent for cell staining, detecting or visuaUzing 

ing or measuring a Tm, by detecting the presence of a label appUcatioos and they are optionally labeled using standard 

present on the invention oligonucleotide or on the comple- ^ labels, e.g., radiolabels (^^p, ^^S, ^""C, ^H), fiuorcscent 

mentary nucleic acid (after separating bound invention oU- labels such as fluorescein, Texas Red, rhodamine, BODIPY, 

gonucleotide from unbound invention oligonucleotide), by resorufin or arylsulfonate cyanines and chemiluminescent 

ampUfying nucleic acids containing a region(s) complemen- labels, e.g., acridinium esters. 

tary to an invention oUgonucleotide and so forth. Because of Embodiments of such optionally labeled highly UpophiUc 

this, invention oUgonucleotides optionally include species 65 invention oUgonucleotides include species where (i) at least 

containing one or more modifications that decrease binding about 30%, typically at least about 40%, usually at least 

affinity, while the oUgonucleotide still retains sufficient about 60% (often at least about 80%), of the intemucleotide 
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linkages are non-ionic intemucleotide linkages (typically 
containing a lipophilic moiety at each non-ionic linkage), or 
(ii) ai least about 30%, typically at least about 40% usually 
at least about 60%, of the bases included in said oligonucle- 
otide contain a lipophilic substitution, (iii) at least about ^ 
30%, typically at least about 40% usually at least about 60%, 
of the sugars, usually at the 2' position, included in said 
oligonucleotide contain a lipophilic substitution or (iv) the 
percent non-ionic nucleotide linkage and the percent lipo- 20 
philic bases and the percent lipophilic sugars sum to at least 
about 30%, typically at least about 40% usually at least 
about 60% (or at least about 80%). 

Usually, the invention base analogs and other noninven- 
tion bases that are present in binding-competent oligonucle- 
otides are linked together by an organic moiety that is 
sufficiently flexible to permit the invention base analog(s) to 
hybridize to complementary bases. The linkage may be a 
conventional phosphodiesler linkage in which a nucleotide 20 
analog containing a structure (1) compound, where is 
deoxyribosyl, ribosyl or an analog thereof, which is incor- 
porated into an oligonucleotide by conventional methods. 
Alternatively, other groups are used to replace the phos- 
phodiester linkage or, in some instances, both of the phos- 
phodiester linkage and the sugar group. These replacement 
groups are termed "phosphodiester substitute linkages" for 
the purposes herein. 

Phosphodiester substitute linkages are well-known from 30 
the prior literature. They include for example phospho- 
rodithioates (Marshal, "Science" 259:1564, 1993), phospho- 
rothioates and alkylphosphonates (U.S. Pat. No. 5,212,295, > 
Kibler-Herzog, "Nucleic Adds Research" [hereafter "NAR" 
] 19:2979, 1991; PCT 92/01020; EP 288,163; FIG. 12-1), 
phosphoroamidates (Froehler, "NAR" 16:4831, 1988), 
3'-NH phosphoramidates (Schultz, "NAR" 24:2966, 1996; 
Gryaznov, "J Am Chem Soc" [hereafter "JACS"] 116:3143, 
1994; Chen, "NAR" 22:2661, 1995; Gryaznov, "Proc Natl 40 
Acad Sci" USA 92:5798, 1995), phosphotriesters (Marcus- 
Sekura, "NAR" 15:5749, 1987), boranophosphates (Sood, 
"JACS" 112:9000, 1991), 3'-0-5'-S- phosphorothioates 
(Mag, "NAR" 19:1437, 1991), 3'.S-5'-0-phosphorothioates 
(Kyle, Biochemistry 31:3012, 1992), 3'-CH2-5'-0- 
phosphonates (Heinemann, "NAR" 19:427, 1991), 3'-NH- 
5'-0-phosphonates (Mag, "Tet. Lett," 33:7323, 1992), sul- 
fonates and sulfonamides (Reynolds, "JOC" 57:2983, 1992), 
sulfones (Huie, "JOC' 57:4519, 1992), sulfoxides (Huang, 50 
"JOC* 56:3869, 1991), sulfides (Schneider, "Tet Lett." 
30:335, 1989), sulfamates, ketals and formacetals 
(Matieucci, "JACS" 113:7767, 1991, PCT 92/03385 and 
PCT 90/06110), 3'-thioformacetals (Jones, "JOC* 58:2983, 
1993), 5'-S-thioethers (Kawai, "Nucleosides Nucleotides" 
10:1485, 1991), carbonates (Gait, "J Chem Soc Perkin Trans 
1" 1389, 1979), carbamates (Stirchak "JOC" 52:4202, 
1987), hydroxylamines (Vasseur, "JACS" 114:4006, 1992), 



35 



methylamine (metbylimines) and metbyleneoxy 
(methylimino) (Debart, "Bioorg Med Chem Lett" 2:1479, 
1992) and amino (PCT 91/06855). Also of interest are 
hydrazino and siloxane (U.S. Pat. No. 5,214,134) linkages, 
thionotriester linkages (WO 96/29337) and related synthesis 
methods (WO 97/31009). 

Phosphodiester substitute linkages per se also are known 
for the replacement of the entire phosphoribosyl linkage of 
conventional oligonucleotides. These include for example 
morpholino-carbamatcs (Stirchak, "NAR" 17:6129, 1989), 
peptides (Nielsen et al., "Science" 254:1497, 1991; U.S. Ser. 
Nos. 07/892,902 and 07/894, 397), riboacetal linkages (PCT 
92/10793) and morpholino-based linkages disclosed in U.S. 
Pat. Nos. 5,521,063 and 5,185,144. 

Additional disclosure of phosphodiester substitute link- 
ages is found in U.S. Pat. No. 5386,023, U.S. Pat. No. 
5,489,677, WO 95/18623, WO 94/00467, WO 93/08296, 
WO 92/20822, WO 92/20823, PCT 91/08213, 90/15065, 
91/15500, 92/20702, 92/20822, 92/20823, 89/12060 and 
91/03680; Mertes, "J Med Chem" 12:154, 1969; Mungall, 
"JOC" 42:703, 1977; Wang, "Tet Lett" 32:7385, 1991; 
Stirchak, "NAR" 17:6129, 1989; Hewitt, "Nucleosides and 
Nucleotides" 11:1661, 1992; Van Aerschot, "Agoew Chem 
Int Ed Engl" 34:1338. 1995; and U.S. Pat. Nos. 5,034,506 
and 5,142,047. 

Invention embodiments include oligonucleotides having 
1, 2, 3 or more optionally protected invention bases, 0 to 
about 30 other optionally protected bases, usually 
guanosine, adenine, thymine, cytosine or 5-methylcytosine, 
and at least one modified linkage, e.g., 3' — N(R^^)— O — 5', 
where R^^ is hydrogen, C^.g alkyl, usually — CH3 or — C2H5 
and the terminal atoms are linked to the 3' and 5' carbons of 
adjacent ribose or 2'-deoxy-2'-R^^ substituted ribose sugars. 



(51) 



45 



(R*^2N — ^P=o 





6,028,183 

24 

(52) gen or a protecting group; R'*^ is Cj.g alkyl or a protecting 
group; and R'*^ is — ^H, a counter ion or a hydrolyzable 
moiety sudi as 



.38 R2* 




(53) 




(R^V — P=0 10 r46 ^ ^y^yi containing 1, 2, 3, 4, 5, 6, 7 or 8 caibon atoms. • 

R'"' pairs include ones where one R'*^ is hydrogen and the 
other R'*^ is alkyl containing 1, 2, 3, 4, 5 or 6 carbon atoms 
or substituted alkyl containing 1, 2, 3, 4, 5, 6, 7 or 8 carbon 
atoms, including methyl, ethyl, methoxyethyl and ethoxy- 
15 ethyl. When R'^^is substituted alkyl, it will usually contain 
1, 2, 3 or 4 non-carbon atoms, but may contain additional 
non-carbon atoms, particularly when the non-carbon atoms 
are halogens or when a group is present as a protecting 
group, e.g., R^^ R^, R^' or R'*^ 
20 Structure (2) and (52) oligonucleotides include species 
where one or more R^^ is — F, — 0(CH2)2NHR^, 
— 0(CH2)3NHR^ — 0(CH2)4NHR^ — O(CH2)20CH3, 
— 0(CH2)30CH3, — 0(CH2)20R^ — ©(CHi)^?, — ©(CHj) 
gOR^, or — 0(CH2)3F (see, e.g., Griffey "J Med Chem" 
25 39:5100-5109 1996, Schultze "Cell" 24:2966-2973 1996), 
Such oligonucleotides include oligonucleotides where 1, 2, 
3, 4, 5, 6, 7, 8 or more monomers are substituted with R^^, 
which will optionally comprise one of these substituents and 
the remaining R^^ are dl hydrogea Embodiments also 
30 include optionally protected monomers containing an 
optionally protected invention base for synthesis of 
phosphoramidate-linked oligonucleotides. Oligonucleotides 
containing one or more of these linkages are optionally 
prepared as highly lipophilic oligonucleotides and they are 
35 suitable for cell staining uses, diagnostic uses and for 
antisense applications that optionally rely at least in part on 
an RNase H mechanism. 

r45s ^p=o Invention embodiments include oligonucleotides or 

I monomers described in U.S. Pat. Nos. 5,670,489, 5,667,976, 

iJ^ 40 5,6.523.55, 5,652^56 and 5,212,295 where one or more 

optionally protected invention bases is present, usually 1, 2, 
where R^® independently is O, CH2 or NH; R'^ indepen- ^ or 4. 

dently is hydrogen, C^^ alkyl (methyl, ethyl, n-propyl, Invention embodiments mclude oUgonucleotides having 
i-propyl, n-butyl, i-butyl, t-butyl, etc.), a protecting group or h 2, 3 or more optionally protected invention bases, 0 to 
both R'*^ together with the nitrogen atom to. which they are 45 about 30 other bases (optionally protected) and at least one 
attached form amide linkage, e.g., a compound of structure (2B) where n 

is 0 to about 50, usually about 5-21. Such amide linkages 

/■ ■ y i y J V have been described, e.g., (Haaima "Agnew Chem Int Hd 

r / \ c \g r / \ Engl" 35:1939-1942 1996; Nielsen "Bioconjugate Chem" 

^ \ / * V / ' ^ \ / ' ^0 ^-^"^ 1994). Other amide linkages that are suitable have 

^ ^ ^ ^ ^ ^ been described, e.g., WO 92/20702 and WO 93/24507. 

y's. Embodiments also include optionally protected monomers 

N ^ or S — X containing an optionally protected invention base for syn- 

\ / thesis of amide-linked oligonucleotides. In general, structure 

55 (2B) oligonucleotides will contain only amide linkages, but 
, . «an . „ , they may also comprise a domain of monomers linked by 

or both R^ together are a protecting group, or R^° is alkyl ^on-amide linkages. Suitable and have been 
(Ci-C,2), usually unbranched or br described, e.g., WO 86/05518, WO 92/20702, WO 

1, 2, 3, 4, 5, 6, 7 or 8 carbon atoms, mcluding methyl, ethyl, nmAznn rk2 ' „ 

nVopyl and Lopropvl) or R^ is substituted alkvUQ^^^ ^^^^5^^' ?Pt^°°^lly ^<^°^P^^^f ^ P^P^^3^ ^^^P^^°g 
usIiaUyunbranched or branched once containing!,^ group, a protectmg group or a sohd support. D opUonally 

5, 6, 7 or 8 carbon atoms, with substituents including one, comprises -H a peptide couphng group, a protecting 
two or more^-^, ~C(0)-, -OC(0)-, -C(0)0-, g^^^P' ^ solid support, but D and D are not both a 

OR'*^, SR"*^, C(0)NR^^ , C(0)N(R'*^)2, peptide couplmg group or a soUd support. WO 92/20702. 

—NR'*^-!, _~N(R'^%, halo (e.g., — F, — Q), — CN, — NO2 described activated derivatives of — COjH and — SO3H. 
moieties); R"*^ independently is hydrogen, a protecting 65 The phosphodiester or phosphodiester substitute linkages 
group (or both R^^ together are a protecting group), alkyl herein are used to bond the 2' or 3' carbon atoms of ribose 
(C1-C4 including methyl, ethyl and n-propyl); R'*^ is hydro- or ribose analogs to the 5' carbon atoms of the adjacent 
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nbose or ribose analog. Ordinarily, the linkages in oligo- Embodiments of the oligonucleotides of the invention 
nucleotides are used to bond the 3* atom of the 5* terminal comprise a moiety which is capable of effecting at least one 
oligonucleotide to the 5' carbon atom of the next 3'-adjaccnt covalcnt bond between the oligonucleotide and a nucleic 
nucleotide or its analog. In general, linkages that contain a acid duplex or strand. Multiple covalent bonds can also be 
phosphorus atom will be 3*,5' linkages and not 2' ,5' linkages 5 formed by providing a multiplicity of such crosslinking 
• because such linkages usually confer reduced binding afBn- moieties. The covalent bond is preferably to a base residue 
ity on the oligonucleotide in which they are present. in the target strand, but can also be made with other portions 
Table 1 of U.S. Pat. No. 5,502,177 describes examples of of the target, including the saccharide or phosphodicster. 
suitable phosphodiester substitute linkages for use with the Preferred crosslioking moieties include acylating and alky- 
invention base analogs. The starting materials in Table 1, or lo lating agents, and, in particular, those positioned relative to 
those used to prepare the starting materials of Table 1, the sequence specificity-conferring portion so as to permit 
generally possess structure (1) in which is ribose, reaction with the target location in the strand. Exemplary 
2*-deoxyribose, a ribose analog or a 2'-deoxyribose analog crosslinking moieties are disclosed and claimed in PCT 
comprising a 5' hydroxyl group and a 3' or 2' hydroxyl 91/03680. See also Praseuth ("Proc Natl Acad Sci" 8:1349, 
group, prepared as described herein or in the citations, with 15 1988), Fedorova ("FEES" 228:273, 1988), Meyer ("JACS" 
an invention base analog(s) being substimted for the bases 111:8517, 1989), Lee ("Biochemistry" 27:3197, 1988), 
used in the citations. The reactions are repeated or ganged Home ("JACS" 112:2435, 1990), Shaw ("JACS" 113:7765, 
with phosphodiester or other linkages in order lo produce 1991). 

trimers; tetramers, pentamers or larger oligonucleotides, Invention embodiments include monomers and oligo- 

including ones up to about 100 bases. 20 nucleotides containing 1, 2, 3 or more invention bases and 

The oligonucleotides of this invention contain naturally a 5' hydroxyl group protected with a base labile protecting 

occurring nucleotides or derivatives thereof. In some oligo- group, including a dansylethoxycarbonyl group, which has 

nucleotide embodimenls the companion nucleotide residues been dcsc:ribcd, e.g., U.S. Pal. No. 5,631,362. Such embodi- 

conlain pyrimidine nucleotides substituted at the 5 position ments optionally include additional protecting groups, 

with a carbon atom which is distally Pi bonded to another 25 Invention embodiments include monomers and oligo- 

atom as for instance 1-alkenyl, l-alkjmyl, heteroaromatic nucleotides containing 1, 2, 3 or more invention bases and 

and 1-alkynyl-heteroaromatic groups such as 5-(l- an invention base or a non-invention base having an exo- 

propynyl)-cytosine and -uridine nucleotides (see PCT Pub- cyclic nitrogen atom, where the nitrogen atom is protected 

lication No. WO 93A0820 and U.S. Pat. No. 5,594,121). with a protecting group as previously described, e.g., speci- 

Other analogs of native bases for use herein include alky- 30 fication and claims 1, 2, 3, 4, 5, 6, 7 and 8 of U.S. Pat. No. 

lated purines or pyrimidines, acylated purines or 5,623,068. Such embodiments optionally include additional 

pyrimidines, or other analogs of purine or pyrimidine bases protecting groups. 

and their aza and deaza analogs. These include, for example Invention embodiments include optionally protected 

N'*,N'*-ethanocytosine, 7-deazaxanthosine , monomers and optionally protected oligonucleotides con- 

7-deazaguanosine, 8 -oxo-N*-methyladenine, 35 taining 1, 2, 3 or more invention bases wherein the compo- 

4- acelylcytosine, 5-(carboxyhydroxylmethyl) uracil, silions possess N-branching, which has been described, e.g., 

5- fluorouracil, 5 -bromouracil, U.S. Pat. No, 5,623,049. 

5-carboxymethylaminomethyl-2-thiouracil, Invention embodiments include "hybrid" 

5-carboxymethylaminomethyl uracil, inosine, oligonucleotides, which contain 1, 2, 3 or more invention 

N*^-isopen ten yl -adeni ne, 1 -meih yl adenine, 40 bases and 2' modifications in one or two regions or domains 

2-methylguanine, 5-methylcytosine, N^-methyladenine, that comprise adjacent linked monomers, typically about 

7-methylguanine, 5-methylaminomethyl uracil, 5-methoxy 2-8 linked monomers, usually about 2-3. One domain 

aminomethyl-2-thiouracil, 5-methoxyuracil, pseudouracil, contains 2' modifications, while hydrogen is linked to the 

5- methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-(l- remaining monomers, typically about 4-10 adjacent linked 
propynyl)-4-thiouracil, 5-(l-propynyl)-2-thiouracil, 5-(l- 45 monomers, usually about 6-8. Such oligonucleotides con- 
propynyl)-2-thiocytosine, 2-thiothymidine, and 2,6- lain at least one domain that is competent to serve as a 
diaminopurine. In addition to these base analogs, one can RNase H substrate and comprises hydrogen at each 2* 
conveniently incorporate into the invention oHgonucleotides position and phosphodiester, phosphorothioate or phospho- 
other base analogs, including pyrimidine analogs including rodithioate 3',5' linkages. The other domain(s) contain a 2* 

6- azacytosine, 6-azathymidine, 5-trifluoromethyluracil or 50 modification(s), such as — O — (€112)2^ or — O — (CH^2 — 
other bases previously described, see, e.g., bases, monomers 0 — CH3, that enhances binding affinity or nuclease stability, 
or oligonucleotides described in WO 92/02258, WO The 2'-modified domain(s) is usually not an efficient RNase 
97/32888 and U.S. Pat. No. 5,614,617. H substrate. The bases in hybrid oligonucleotides are the 

Preferred bases include adenine, guanine, thymine, uracil, typical purines and pyrimidines found in nucleic acids (G, A, 

cytosine, 5-methylcytosine, 5-(l-propynyl)uracil, 5-(l- 55 T, C or U) or their analogs, which one finds in sonae 

propynyl)cytosine and 5-(l-butynyl)uracil, 5-(l-butynyl) oligonucleotide analogs (e.g., U.S. Pat. Nos. 5,484,908, 

cytosine. 5,594,121 and 5,502,177, International Publication No. WO 

Invention embodiments include the protected derivatives 93/10820). Intermediates used to prepare hybrid oligonucle- 

of native bases, their analogs and optionally protected otides will typically contain appropriately protected deriva- 

monomer synthons containing such bases, which one would 60 lives of any bases. 

typically use as intermediates to prepare invention oligo- Oligonucleotides of inverted polarity also fall within the 

nucleotides (see, e.g.. International Ihiblication No. WO scope of this invention. "Inverted polarity" meanis that the 

96/37504, U.S. Pat. No. 5,614,622 Iyer et al., "Nucleosides oligonucleotide contains tandem sequences which have 

& Nucleotides", 14:1349-57, 1995, Uhlmann et al., "Chem opposite polarity, i.e., one having polarity 5'-»3' followed by 

Revs", 90:543-587, 1990, S. Agrawal, Ed. Methods in 65 another with polarity 3'-»'5', or vice versa. These sequences 

Molecular Biology, Vol. 20, Protocols for Oligonucleotides thus are joined by linkages which can be thought of as 

and Analogs, pp. 165-189, Humana Press, 1993) effectively a 3* — ^3' intemucleoside junction (however the 
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linkage is accomplished), or effectively a 5' — ^5' internucleo- 
side junction. For a further descriptioa of suitable methods 
for making such oligonucleotides sec, e.g., WO 93/10820. 
Compositions of "parallel-stranded DNA" designed to fonm 
hairpins secured with AT linkages using either a 3-3' inver- 5 
sion or a 5 '-5' inversion have been synthesized by Van de 
Sande, "Science" 241:551, 1988. In addition, oligonucle- 
otides which contain 3'-3' linkages have been described 
(Home, op cit; and Froehler, "Biochemistry" 31:1603, 
1992). These oligonucleotides are useful as binding partners 10 
for double stranded nucleic acids to form triple helix (or 
U-iplex) complexes as a means for detecting complementary 
sequences and inhibiting of target gene expression (PCT 
89/05769 and 91/09321). 

Invention embodiments include polymers such as oligo- 15 
nucleotides containing 1, 2, 3 or more invention bases, 
where the polymer is a component of a complex or compo- 
sition useful for transCectiog the polymer into a cell in viu^o 
or in vivo. These complexes or compositions are referred to 
herein as "traosfection complexes". Such transfection com- 20 
plexes optionally comprise one or more lipids, e.g., cationic 
or anionic lipids, as well as other lipophilic compoimds such 
as cholesterol or colipids such as DOPE. The complexes 
optionally comprise an uncharged or a charged polymer such 
as polyethylene glycol, polybrene or a peptide, e.g., polyl- 25 
ysine. The complexes optionally comprise unilamellar or 
multilamellar liposomes or vesicles. 

As used herein, any oompound(s), reagent(s) or treatment 
that enhances delivery of an invention oligonucleotide into 
a cell or tissue is a "permeation enhancing agent." Perme- 30 
ation enhancing agents are well known and are usually 
present as transfection complexes containing 
oligonucleotides, e.g., unilamellar or multilamellar lipo- 
somes or vesicles. One uses permeation enhancing agents to 
prepare transfection complexes containing invention oligo- 35 
nucleotides. The permeation enhancing agent are used in 
essentially the same manner as is used to prepared transfec- 
tion complexes containing nucleic acids, non-invention oli- 
gonucleotides or polymers into ceUs or tissues. 

Invention traasfection complexes optionally comprise an 40 
additional non-invention polymer(s), e.g., a nucleic acid 
expression vector(s), a therapeutic agent(s) (e.g., amphot- 
ericin B) or a peptide(s). 

Invention transfection complexes comprising a lipid may 
be, as defined herein, "large", i.e., complexes having a 45 
maximum average dimension of at least about 200 nm in 
length or diameter, typically having an average length or 
diameter of about 200-400 nm, occasionally having an 
average length or diameter of about 400-800 nm. Transfec- 
tion complexes comprising a lipid may be "small", i.e., 50 
. complexes having a maximum average dimension of about 
15-200 nm in length or diameter, e.g., an average dimension 
of about 60-120 nm. Transtection complexes may comprise 
a mixmre of large and small complexes in about equal 
proportions or they may comprise a preponderance of smaU 55 
or large transfection complexes, e.g., at least about 55%, or 
at least about 60-80% of the complexes in a given prepa- 
ration are large or small. 

Transfection complexes comprising a lipid optionally 
include a stabilizing compound(s), e.g., a monosaccharide or 60 
a disaccharide such as glucose, trehalose, maltose or 
sucrose, that is present at the outer surface, or at the inner 
surface or at both surfaces of transfection complexes. Work- 
ers have described suitable compounds such as lipids, colip- 
ids and stabilizing compounds for making transfection 65 
complexes, methods to size the complexes and methods to 
use the complexes to deliver a polymer or monomer into the 



cytoplasm of a cell in vitro or in vivo, e.g., U.S. Pat. Nos. 
5,635,491, 5,633,156, 5,631,018. 5,629,184, 5,627,159, 
5,626,867, 5,620,689, 5,595,756, 5,543,152, 5,478,860, 
5,459,127, 5,264,618, 5,223,263, 5,194,654, 4,981,692, 
5,077,056, 4,522,803, 4,588,578, 4,885,172, 4,975,282, 
5,059,421, 5,000,958, 5,030,453 and 5,047,245, WO 
96/01840, WO 96/01841, WO 97/30969, WO 97/30732, 
Uwis, "Proc Nad Acad Sci" 93:3176-3181, 1996, U.S. 
patent application Ser. No. 08/672,206. 

Invention transfection complexes useful for delivering the 
invention oligonucleotides into cell cytoplasm also include 
complexes comprising inorganic compounds, e.g., calcium 
phosphate. 

The R^^ moiety is linked to invention ohgonucleotides or 
monomers useful for oligonucleotide synthesis. R^^ is usu- 
ally linked to the 2' or the 3' position of furanose sugars. 
When R^^ is a nuclease stability enhancing moiety, a broad 
range of structures may be used to increase stability of 
oligodeoxynucleotides or oligoribonucleotides containing 
phosphodiester linkages. Oligonucleotides having moieties 
other than hydrogen or hydroxyl at the 2' position usually 
confer increased nuclease stability or increased binding 
afGnity on the oligonucleotide relative to hydrogen or 
hydroxyl. Enhanced nuclease stability is conveniently mea- 
sured using dimers or short oligonucleotides as essentially 
described, e.g., WO 92/05186. One or two R^^ moieties at 
the 3' and 5' terminal monomers in an oligonucleotide will 
increase stability of the oligonucleotide to 3'- and 
5 -exo nucleases. One may increase an oligonucleotide's 
stability to endonucleases by incorporating R^^ moieties that 
increase nuclease stability at internal monomer positions. 

In addition to increasing nuclease stability, some R^^ 
moieties enhance binding affinity of the oligonucleotide to 
which they are linked. These moieties include fluorine and 
short unbranched optionally substituted 0-alkyl groups con- 
taining about 2-8 carbon atoms, where the alkyl group is 
optionally substituted at the distal carbon atom with, e.g. 
— ^F, — OH or — NH2, and optionally substituted with an 
ether at an internal carbon, e.g., 
— 0— (CH2)2— O— (CH2)2F, — 0— (CHj)20CH3 or 
— O— (CH2)2— O— (CH2)2CH3. 

When R^^ is a nuclease stability enhancing moiety, it will 
typically comprise — CH3, =0 — NHR^ or a chain having 
a backbone containing 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 
linked atoms, wherein the chain usually comprises carbon 
(C) atoms and optionally 1, 2, 3 or 4 atoms independently 
selected from the group consisting of oxygen (O), nitrogen 
(N) and sulfur (S) atoms. The chain is usually linked to the 
sugar carbon atom through — 0^ — , — S — , --S(O) — , S(0) 
(0>-, — CHj— , ^CH— or — NH— . The chain is 
branched or imbranched, often it is unbranched or has only 
limited branching, e.g., — CH3, — CHjOH, — C2H5 or 
— C2H4OH. The R^-* chain may comprise a C2.12 ^^cyl 
group or a C2.20 substituted alkyl group. If R^^ is a substi- 
tuted alkyl group, usually only 1, 2 or 3 carbon atoms are 
substituted. Suitable substituents include those described 
above for substituted alkyl groups, e.g., halogen (usually 
fluorine), — O — or — OR^. 

Invention embodiments include oligonucleotides and 
monomers where one or more monomers comprise 
2*-deoxyribose, ribose or arabinose sugars, or their carbocy- 
clic analogs, having one or more 2' R^^ modification such as, 
— O— alkvl, ^NH— alkyl, alkyl, — OR^ ~NHR^ 
— SR^ -halo (usually — F), — R^-alkyl or — R44- 
substituted alkyl wherein the alkyl or substituted alkyl group 
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usually comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 caiboo the bydroxyl, amino and any other labile groups of are 
atoms, usually about 2-6 carbon atoms, where R^^ is inde* protected as required. In another approach, R' of the starting 
pcndcntly — O — , — S — , — NH — or — CHj— , usuaUy material is H or a protecting group and one adds the linker 
— O— . The oligonucleotide linkages connecting such after the ring closure steps set forth in the schemes, in the 
monomers are 3',5' Unkages. The alkyl groups at the 2' 5 fashion as has heretofore been employed in the alky- 
position typicaUy have 1, 2, 3, 4, 5. 6, 7, 8, 9, 10, 11 or 12 lotion of pyrimidine bases intended for use as antiviral 
carbon atoms which are optionaUy present as methylene compounds. 

groups (— CH,— ) and optionaUy have 1, 2, 3 or 4 ether embodiments in which R is a binding partner 

(_0_) or other substitutions, e.g., 0-alkoxyalkyl (Q-C^^ such as a polymer the compounds of this invenUon are 

alkyl), -0-(CHJ,8-CH^C0,H. -©-(CHJ^.s- lo synthesized by covalently cr(»slmkmg the Imker modified 

rH / n } %u \ n rrw n polycyclic base of this mvenuon to the bmdmg partner, or 

rrus ^'Z^'^^s^^lTntrV^ " (^^ere the binding partner is a polymer) by incorporating 

n PH n« nrrHPHWrHr^ into the polymer a monomer unit which is substimtcd by an 

"2:7. ' .? u n invenUon polycyclic base. 

~?.^'^Z''"'^t '"i:^s^'^°.^'^f\^^^ . 1° '•'-^ fi'^' embodiment (polymer grafting) a 

5 • 1 ? " ' " R^-substituted polycycUc substniciiie is covalenUy bonded 

usually 1 or 2. conventional cross-linking agent to the polymer. 

The alkyl groups at the 2 position also include substituted Most convcniendy, structure (1) compounds in which is 

alkyl, e.g., — 0 — alkylamino, — S-alkylamino, — NH- hydroxyl- or amino-substituted alkyl are readily cross- 

alkylamino, or their protected derivatives, wherein the alkyl linked to reactive groups present in the molecule to be 

group contains 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 carbon 20 labeled as noted above. Exemplary cross-linking agents 

atoms, which are all optionally present as methylene carbons include succinic anhydride, N-hydroxysuccinimide esters 

( — CH2 — ). Usually the alkyl group or substituted alkyl (biotin NHS ester), epoxides, isothiocyanates, imidates, 

group at R^^ will contain 1,2, 3, 4, 5, 6, 7 or 8 carbon atoms. DCC (dicclohexylcarbodiimide), EDC (l-ethyl-3-[3- 

Such groups include —O-methyl,—0-ethyl,—0-n-propyl, (dimethylamino)propyl] carbodiimide), BOP, and 

— 0-allyl, — <CH2), .OH, including — O^CH2)20H, 25 glutaraldehyde, see, e.g., EP 0 063 879, Ruth "J Org Chem" 

--0^CH2)2F,^-^"H2CHF2, -(>U:H^F3, -^O- 43:2870, 1978, Bergstrom, JACS 100:8106, 1978, Bigge, 

(CH2), eOCH3, including --0^CH2)20CH3, -~0- J^CS 102:2033, 1980. Cyanogen bromide activated carbo- 

(CH,)^6cH2CH3, -0^CH,),0CH,CH,0H, -~0- ^y^ff^ ^'V^f; T^^ cross-hnking agents are used to 

CH2)20CH2CH2F, _0_\CH2).NHR^ ^^^^ the R^ -substituted polycycie to the polymer in the 

—TLl/rw N Mup5 >^_/ru \ MHU5^ cuUCU V 30 ^^shion as polymers heretofore have been cross-hnked 

rPH \ F 'c^Tu^F^ ^g" ^y^^^^^ amino-bearing moieties. An 

~n H ^ p ^'^ n rru\ rn fr^T ^ 1 r^?' ^ ^^^"^P^^ ^ ^^^^^^^^ "^^^^^^ described per se in Cook et 

■"^tTT^ 4^ , ' ~kttT^ u ^r^*- xTT^ ^^^-"^ i' al., U.S. Pat, No. 5,218,105. This method is readily applied 

-f ^"i^^"^^; ^^^^ Wh^: R^rS°~ substituted, the foUowing 

(^H ) 01?'°''' iJS C H) F d exemplary synthetic approaches are suitable for ciosslinking 

U4§Ws-{0^CH^.]A whe% s is 2, 3, 4, 5, 6:7 "hIJh ^R-'tm^^ ) OR-^ CH NHaN* 

or 8. R" moieties do not include unstable species at the 2' h\!_ri5 — C(N H^)— OK — CH^NHC^N 

position, e.g., — O — O — or — S — O— . 40 HJ R , ,n 

Other suLble R- at the 2' or 3' position of optionally ^^^'^^'^'^ "7^^^=^^™^^^^^ 

protected invention monomers or optionally protected — CH^NH^+R -epoxide-*— CH^NHCHjCHCOH)— 

invention oligonucleotides include — ^H, — ^^H, — NHOR^^, ^ , n 

=NH, — N3, — CN, — CH^CN, — CHCI2, — CFH2, where R'° is an organic moiety optionally contammg a 

__Cr II =C1I CF^ CIl2CIl=CIl2, =0, =CIIC hapten or other detectable moiety such as biotin or avadin. 

(0)0R" (including =CHC(0)0CH3 and =CHC(0) 1° the second embodiment (copolymerization) the R' 

OCH2CH3), — 0C(S)0C6Hs, t-butyldimethylsilyl ether, tri- ^^^^ is capable of functioning as a monomer for copoly- 

isopropylsilyl ether, a 2' amino group protected by an merization with other monomer units that may or may not be 

N-phthalovl protecting group or a fluorescent label, R" is substituted with the polycyclic substructure of structure (1). 

indepcndenUy R^ alkyl having 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 50 In some embodiments, the R' Unker is an alkyl carboxylate, 

carbon atoms or substituted alkyl having 1, 2, 3, 4, 5, 6, 7, an alkyl amine or an amino acid for incorporation into 

8, 9 or 10 carbon atoms where the cariDon atoms in R^^ are Peptides by in vitro methods. However, in the typical 

optionally all present as methylene (— CH2— ) or substituted embodiment the R polymeric binding partner is an oligo- 

methylene (— CH(substitution)-) moieties. Workers have nucleotide as depicted in structure (2), and these conve- 

described suitable 2' modified monomers and 55 mently are made by copolymerizaUon with a nucleoUde 

oUgonucleotides, eig., WO 97/14706, WO 96/05298, WO analog substituted with the polycyclic substructure. The 

93/13121, and WO 91/06556 and U. S. Pat. Nos. 5,631,360, starting matenals for the synthesis of sUiicture (2) generally 

5,627,053, 5,623,065, 5,576,302 and 5,578,718. are compounds of strucmre (1) in which R is nbose or 

deoxyribose substituted with appropriate protecting and 
Methods for Synthesis coupling groups further described above. Suitable starting 
The compounds of structure (1) where R^ is a linker or H monomers for ohgonucleotides having phosphodiester sub- 
are prepared by methods known in the art per se and as more stitute linkages are set forth in Table 1, and they are prepared 
fully described below. Typically, such compounds are pre- in the same fashion as other nucleotide analog bases 
pared from a 5-bromouracil, 5-bromouridin-l-yl, described in the literature. Similarly, conventional phos- 
5-iodouracil, 5-iodouridin-l-yl substituted derivative as 65 phodiester or phosphorothioate linkages are prepared from 
shown in the synthetic schemes below and subsequent nucleotide analogs containing coupling groups D and 
reactions close the polycyclic ring. In these embodiments described above. The compounds of this invention then are 
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incorporated into the desired oligonucleotide by known 
methods of in vitro synthesis described in the referenced 
methods. Alternatively, polycylic substructure -substituted 
nucleotide triphosphates may be incorporated into oligo- 
nucleotides as cytosine analogs by DNA polymerase or 
reverse transcriptase in vivo or in vitro (see Ward, U.S. Pat. 
No. 4,711,955). In this case, is ribosyl or deoxribosyl 
triphosphate, or a triphosphorylated analog thereof recog- 
nized by DNA polymerase or reverse transcriptase which is 
then incorporated into an oligonucleotide by template- 
directed transcription. 

Synthesis of oligonucleotides containing 3 or more nucle- 
otide residues is optionally accomplished using synthons 
such as dimers (which contain substitute or diester linkages) 
or trimers, each carrying a terminal coupUng group suitable 
for use with amidite, H-phosphonate or triester chemistries. 
The synthon is then linked to the oligonucleotide or another 
synthon via a phosphodiesler or phosphorous-containing 
phosphodiester substitute linkage. 

Oligonucleotides containing phosphorothioate, meth- 20 
ylphosphonate and phosphodiester linkages are readily pre- 
pared by solid-phase oligonucleotide synthesis techniques. A 
description of modiGcalions useful in the synthesis of phos- 
phorothioate linked oligonucleotides are found, for example, 



10 



15 



in £P 288,163, wherein the oxidation step in solid phase 
automated synthesis using amidite chemistry can be inde- 
pendently adjusted at any step to obtain the phosphorothio- 
ate. An alternate method for synthesis of oUgonucleotides 
with phosphorothioate linkages, via hydrogen phosphonate 
chemistry, has also been described (Froehler "NAR" 
14:5399, 1986). Sulfurization is accomplished using 
reagents such as tetraethylthiuram disulfide, dibenzoyl 
tetrasulfide, thiophosphoric acid disulfide, 3H-1,2- 
benzodithiol-3-one 1,1 -dioxide and the like as described 
(Vu, "Tet Lett" 26:3005, 1991; Rao, "Tet Lett" 33:4839, 
1992; U.S. Pat. No. 5,151,510; Iyer, "JOC" 55:4693, 1990; 
Dahl, "Sulfur Reports" 11:167, 1991). These sulfurization 
reagents are used with either phosphoramidite or hydrogen- 
phosphonate chemistries. Synthesis of phosphorothioate oU- 
gonucleotides having controlled stereochemistry is used to 
generate stereoregular invention oligonucleotides as 
described (EP 506,242). Thionomethyl phosphonate is pre- 
pared with methylphosphonamidite followed by sulfuriza- 
tion as described (Roelen, "Tet Lett" 33:2357, 1992) or with 
the sulfurization reagents described above. 

One prepares various structure (1) compounds as 
described below and in the examples. 



Scheme A 




PhaP, Ca4 CHzQa, ^ ^ 



Step 1 



(100) 




'step 2 



HN R^'H 





R^^H 



(102) 



N 



(103) 



step 3 

Ph3P, DEAD 
CH2q2,R^H 
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Scheme A depicts preparation of structure (1) conipounds 
where R'*'' is — O— or — S— ; and R^— OH is R^ which 
has a free hydroxyl. 

Step one is conducted by beating the reaction mixture 
containing (100) in an organic solvent to at least about 50* 
C, generally for about 3-4 hours. Step two is performed by 20 
reacting (102) in an organic solvent for about 6-48 hours, 
generally for about 10-20 hours at about 15*" C. to reflux 
temperature, generally at about 18-25° C. The R^ moiety is 
linked under Mitsunobu conditions to (103) in step 3 by 
reacting about 1-1.5 equivalents of the alcohol, i.e., — 
OH, using an activating agent as a leaving group, such as 
triphenylphosphine (Ph^P) and diethyl diazocarboxylate 
(DEAD) to obtain (104). In step 4, (105) is prepared by 
forming the ring containing R"*^ by (1) incubating (104) in 
a polar organic solvent, typically an alkanol containing 1, 2, 
3, 4, 5 or 6 carbon atoms, such as methanol or ethanol, 
containing a mild base such as NH3, TEA (triethylamine), 
DBU (l,8-dia2abicyclo[5.4.0]undec-7-ene) or (2) refluxing 
in ethanol in the presence of potassium fluoride. Generally 
(104) is incubated in saturated NH3 in methanol for about 
2-3 days to afford (105). 

The use of (102) in which one R"*^ is — O — and the other 
is — S — will produce a mixture. One optionally isolates 
each (104) component or one optionally converts the (104) 
mixture to a (105) mixture. One optionally separates the 
mixtures at any convenient point by standard methods, e.g., 
silica gel chromatography, or HPLC. 

When one prepares compounds according to scheme A 
and R^ is a monosaccharide, e.g., 2'-deoxyribose, 2'-deoxy- 
2'-R^^-substituted ribose or arabinose, the sugar hydroxyls 
in (100) are usually protected, generally using base-labile 
protecting groups, e.g., acetate, proprionate, butyrate, phe- 
noxyacetyl. The step 4 reaction imder basic conditions 
removes the protecting groups, which facilitates the ring 
formation reaction resulting in (105). When R^ is not a 
monosaccharide, it is generally a protecting group or an 
optionally protected linker, e.g., — (R"*^)*^ — R , where R"*® 
is independently — CHj — , — O — , — -S — , — ^NH — or 
— C(0)h-, X is 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 and R^^ usually 
is a functional group suitable for linking to a solid support, 
monomer or polymer, including — OR , — SR^, — N(R^)2, 
— 0(0)N(R% — NR^C(0)H, -<:(0)H and — C(0)OR^ 
R^ is independently — H, a protecting group, or both R 
together are a protecting group. GeneraUy the R^ group that 
is adjacent to R*^ is ethylene ( — CHj— il^Hj — ) and gener- 
aUy only 0, 1 or 2 are moieties other than — CH,— , e.g., 
one R'*^ is — O — or — C(0) — and the remaining R are all 
— CH2 — . Exemplary compounds where R^ is a monosac- 
charide have the strucmres (130)-(134), which correspond 
to compounds (lOOHlOl) and (103)h(105) respectively. 
For compounds (130)-(132), R^ at sugar hydroxyls is typi- 
cally a base labile protecting group such as acetyl and R^^ 
is typically — O — . During conversion of (133) to (134), the 
R^ protecting group is removed from the sugar under ring 
closure reaction conditions, which facilitates ring closure. 
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Structure (134) compounds may be converted to mono- 
mers suitable for oligonucleotide synthesis. Such monomers 
typically have a coupling group at the 3* position, e.g., 
H-phosphonate, or a phosphoramidite such as a 
P-cyanoethylphosphoramidite, N,N-diisopropylamino-P- 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. The 5' position will 
contain a DMT-0 — or other protecting group suitable for 
oligonucleotide synthesis. The monomers may alternatively 
have a coupling group at the 5' position and a protecting 30 
group at the 3' position. The protecting and coupling groups 
are added sequentially. Scheme B shows synthesis of struc- 
ture (1) compounds where in is — CHj — . In scheme 
B, Y is 1, 2, 3 or 4; is independently — CH^— , 
— C(0)— , -<CH2)2— 0-<CH2)2— , — (CHa)^— NR^— 
(CH^)^-, -(CH2)2-S-(CH2)2- ^H(N(R^)^)-, 
^CH(COOR^)— or — CCCHg)— , — C(C.H5>- but adja- 
cent moieties are not G(0), usually R^° is — CHj — ; TFA is 
trifluoroacetate; and CBZ is carboxybenzoyl; and (96) is 40 
HteC(R^°)^— NH-TFA. Protecting groups present in R^° 
are stable to the reaction conditions shown. 



Scheme B 
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(96) 



Pd(PPh3)4, 

TEA 



60 



65 



36 

-continued 




aXR^VWTFA 



10 (107) l«2d£H^£^ (108) 
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Compound (97) is converted to (98) by hydrogenation 
reaction in alcohol, usually methanol or ethanol, at about 
IS-IS"" C. for about 10-24 hours, usuaUy about 12-18 
hours. The catalyst is removed and the filtrate is concen- 
trated. 

Compound (98) in trimethyl orthoformate is converted to 
(99) in the presence of acid, e.g., methanesulfonic acid at 
about 15-25** C, usually about 18-22° C. for about 20-120 
minutes. The reaction is cooled and quenched with a base, 
e.g., an oiganic base such as TEA. The reaction mixture is 
concentrated and purified, e.g., by flash column chromatog- 
raphy on silica gel. 

Compound (106) is prepared by reacting (99) in organic 
solution such as DMF, CHjClj or CHjCla/DMF (about 2:1 
v/v) with K2CO3 at about 15-25** C. for about 30 minutes, 
followed by adding phenyltrifluoromethanesulfonimide and 
stirring the mixture for about 10-24 hours, usually about 
12-16 hours. The reaction mixtxue is then diluted with 
CH2CI2, washed with water once or twice and concentrated 
and purified. 

Compound (96) is prepared by reacting 
Hte— C(R^°)yNH2 with ethyl trifluoroacetate at about 
15-25** C. for about 10-24 hours, washing with saturated 35 
aqueous NaHCOg and concentrated. Compound (96)' is 
purified by distillation. 

Compound (107) is prepared by stirring an organic sol- 
vent such as DMF containing about 2 equivalents of (96), 
about 2 equivalents of an organic base such as TEA, and 40 
Pd((PPh)3)4, Cul and about 1 equivalent of (106) at about 
15-25** C. for about 18-36 hours. The organic phase is 
washed with water, dried and purified by silica gel 
chromatography, to obtain (107). 

Compound (107) is hydrogenated in ethanol in the pres- 45 
ence of 10% Pd/C at room temperature. The catalyst is 
filtered off, the filtrate is concentrated and then treated with 
cone. NH^OHidioxane (1:1) to afford (108). The amino 
group in (108) was protected with CBZ to afford (109). 

Compound (110) is prepared by treating (109) in ethanol 50 
with aqueous HCl (about 3 N) at about 15-45** C, usually 
about 40** C, for about 30-120 minutes, usually about 60 
minutes. The product is dried and optionally azeotroped 
using e.g., CH3CN, several limes. 

Compoimd (111) is prepared by reaction of (100) with 55 
mesitylenesulfonyl chloride in the presence of a tertiary 
amine such as TEA. 

Compound (112) is prepared by stirring a mixture of (110) 
and (111) in organic solution containing about 2 equivalents 
of an organic base such as DBU or TEA at about 15-25" C. 
for about 10-24 hours. The reaction mixture is washed with 
an aqueous 10% citric acid solution, dried and purified by 
siHca gel chromatography. 

Compound (113) is prepared by treating (112) with satu- 
rated NH3 in methanol at about 15-25** C. for about 3-4 
days. The reaction mixture is dried, concentrated and puri- 
fied by silica gel chromatography. 



Compound (114) is prepared by hydrogenation of (113) in 
the presence of 10% Pd/C at about 15-25** C. for about 3-6 
hours. Catalyst is removed, washed, and the filtrate is 
concentrated to dryness. The amino group in (114) is pro- 
tected with FMOC to afford (115). 

Where in scheme B is an optionally protected 
monosaccharide such as 2'-deoxyribose, 2'-deoxy-2'-R^^- 
substituted ribose, 2'-deoxy-2-R^^ -substituted arabinose, 
ribose or arabinose, the sugar's hydroxyl groups in com- 
pounds (111) and (112) arc usually protected with a base- 
labile protecting group such as acetyl, propionyl and phe- 
noxyacetyl. These protecting groups are removed by 
Ueatment with base during synthesis of (113). 

Exemplary compounds where R^ is a monosaccharide 
have the strucmres (135)-(139), which correspond to com- 
pounds (111)-(115) respectively. For compounds (135)- 
(136), R^ at sugar hydroxyls is typically a base labile 
protecting group such as acetyl. R^^ is t>pically — O — . 
During conversion of (136) to (137), the R^ protecting group 
is removed ixom the sugar under ring closure reaction 
conditions, which facilitates ring closure. 

Structure (139) compounds may be converted to mono- 
mers suitable for oligonucleotide synthesis. Such monomers 
typically have a coupling group at the 3' position, e.g., 
H-phosphonate, or a phosphoramidite such as a 
P-cyanoethylphosphoramidite, N,N-dii50propyl-amino-p- 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. The 5' position will 
contain a DMT-0- or other protecting group suitable for 
oligonucleotide synthesis. The monomers may alternatively 
have a coupling group at the 5' position and a protecting 
group at the 3' position. The protecting and coupling groups 
are added sequentially. 

(135) 




R^O 

CBZNH(R5°)YCHiCH2 



(136) 
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CBZNH(R^°)yCH2CH2>. 
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HO^ V» 
FmocNH(R5'*)YCH2CH2N 
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Scheme C shows synthesis of structure (1) compounds 
where in is — NH— . 



Scheme C 
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In scheme C, R" is — (CHR^^)--<R^20— CHR^^— , 
where R*^ is —H or Cj-C^ alkyl (typically C1-C3), usually 
— H, and R^^ is a bond, — CHR^^— O— CHR^^— 
— CHR^^^— S— CHR^""^— ^, — CHR"^— NR^— 
CHR^^— C^-Cio alkylene (typically Cg-Cg) optionaUy 
substituted with 1 or 2 moieties selected from the group 
consisting of Cj-Cg alkyl (typically C1-C3 alkyl), — OR^, 
=0, — NO2, — N3, — CN, — COOR^ or — N(R^)2. In R^^ 
any hctcroatom in the spacer chain will be separated from 
the nitrogen atoms that R^~ is linked to by one methylene 
and one or more — CHR^^ — moieties. Typically, adjacent 
carbon atoms in R^^ are not substituted with — OR^, =0, 
_N02, — N3, — CN, — COOR^ or — N(R^)2, Protecting 
groups present on (119A) are usually stable to oxidizing 
conditions and labile to basic conditions. The protected 
intermediate (119A) is prepared by reacting phthalic anhy- 
dride with the appropriate HOCH-^^^— NHj moiety to 
yield a phthalimide compound or by reacting a phthalimide 
compound with HOCH— R^^— Br. 

Compound (117) is prepared by reacting (101) or (111) 
with about 1.0-1.5 equivalents, usually about 1.1 equiva- 
lents of (116) and about 1-2 equivalents, usually about 2 
equivalents of organic base such as DBU or TEA in an 
60 organic solvent such as CH2CI2/CCI4 (about 1:1) or CH2CI2 
by reaction at about 10-30* C, usually at about 18-25** C. 
for about 16-48 hours, usually about 20-28 hours. 

Where R^ in scheme C is an optionally protected 
monosaccharide such as 2*-deoxyribose, 2'-deoxy-2*-R^- 
65 substituted ribose, 2' -deoxy-2'-R^^ -substituted arabinose, 
ribose or arabinose, the sugar's hydroxyl groups in com- 
pounds (101), (111) and (117) are usually protected with a 
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base-labile protecting group such as acetyl, propionyl and 
phenoxyacetyl. Hiese protecting groups are removed by 
treatment with base during synthesis of (118). Compound 

(118) was hydrogenated in ethanol and acid to obtain (119). 
Compound (120) was obtained by reductive alkylation of 

(119) with aldehydes. 
Compound (120) is optionally converted, without remov- 
ing the nitrogen protecting group linked to the exocycHc 
amine, to a monomer suitable for use in oligonucleotide 
synthesis, e.g., (121) (not shown), by protecting the 5* 
hydroxyl group by reaction with a protecting group reagent 
such as DMT-G (4,4'-dimethoxytrityl chloride) to obtain the 
5*-protected derivative (not shown). Compound (121) is then 
converted to a derivative (121A) (not shown) suitable for 
coupling the 3' hydroxyl group with another 5* hydroxyl 
group, i.e., a coupling group such as H-phosphonate or a 
phosphoramidite, e.g., N,N-diisopropylamino-P- 
cyanoethoxyphosphine or N,N- 
diisopropylaminomethoxyphosphine, is linked to the 3' posi- 
tion. Exemplary compounds where is a monosaccharide 
have the strucmres (140)-(143), which correspond to com- 
poimds (117)-(120) respectively. For compounds (140)- 
(141), at sugar hydroxyls is typicaUy a base labile 25 
protecting group such a^ acetyl and R^'^ is typically — O — . 

In structure (143) compounds, the 5' or 3' oxygen, usually 
the 3' oxygen, is linked to — or a coupling group such as 
H-phosphonate, or a phosphoramidite such as 
P-cyanoethylphosphoramidite, N,N-diisopropylamino-P- 30 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. During conversion of 
compound (140) to (141), the R^ protecting group is 
removed from the sugar under ring closure reaction 
conditions, which facilitates ring closure. 
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Structure (143) compounds may be converted to mono- 
mers suitable for oligonucleotide synthesis. Such monomers 
40 typically have a coupling group at the 3' position, e.g., 
H-phosphonate, or a phosphoramidite such as a 
P-cyanoethylphosphoramidite, N,N-diisopropylamino-P- 
cyanoethoxyphosphine or N,N- 

diisopropylaminomethoxyphosphine. The 5' position will 
contain a DMT, MMT or other protecting group suitable for 
oligonucleotide synthesis. The monomers may alternatively 
have a coupling group at the 5' position and a protecting 
group at the 3' position. The protecting and coupling groups 
are added sequentially. During oligonucleotide synthesis, 
(143) is incorporated into an oligonucleotide such, as one of 
strucmre (2), by standard methods and the phthalamide 
protecting group, along with other base labile protecting 
groups at R^^ are removed using basic conditions, e.g., 
NH4OH or NH2CH3, to yield deprotected or partially depro- 
tected — NH— R^^— NH2 as R^ 

Straightforward variations of schemes A-C can be used to 
prepare other structure (1) compounds. For example, scheme 
D depicts a method to prepare structure (1) compounds 
where R^ comprises a cytosine derivative. In scheme D, iPr 
is isopropyl; R^^ is a portion of an R^ moiety having the 
60 strucmre — ^R^ — ^R*° — , where R^ is usually — O — , — S — , 
— NH— or — CH2— ; R^° is 

— NR^, — C(0)— , — CH2— O— CH2— , — CH2— 
NR^— CH2— or — CH2-S— CH2— ; Z3 is 1, 2 or 3, 
65 usually 1; Zl is 0 or 1, usually 0; and Z2 is 1, 2 or 3, usually 
1; any functional groups, e.g., — OH, — NH2, — COOH, 
— SH, that are optionally present at R^^ 
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are protected. Compound (124) is S'^'-diacetyl-C^-sulfonyl- 
2'-deoxyuridiiie, the synthesis of which is described in the 
examples. 



(125) 



Scheme D 




(aSi(CH(CH3)2)2 — 




AcO 




/ \ 
iPr Vr 



(125) 



10 



15 




Compound (126) of scheme D is converted to a monomer 
suitable for oligonucleotide synthesis using standard 
methods, e.g., treatment of (126) with a protecting group 

20 such as DMT-Q protects the 5' oxygen atom and yields 
compound (127) (not shown). One then links a coupUng 
group such as H-phosphonate or a phosphoramidite group 
such as NjN-diisopropylamino-P-cyanoethoxyphosphine or 
N,N-diisoprbpylaminomethoxyphosphine to Uie 3' hydroxyl 

25 group, of (127) to obtain (128) (not shown), which is 
suitable for oligonucleotide synthesis. Variation of the syn- 
thesis shown in scheme D is used to prepare compounds of 
strucnire (129) or (129A). 
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(129) 




(129A) 




Similarly, structure (1) compounds where is 
R* R^'to— ^ ^ , including 



-(CH^i-4NH' 
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continued 



r 



R**^ " NHR5, including 



(CH2)i-5NH NHR^ 
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(CH2),.3— R^ , 



are synthesized using variations of scheme D and the 
corresponding intermediates, e.g., reaction of (123) with 
2-habpyridine (Bematowicz "JOC" 57:2497 1992). 

Compounds of structure (1) where is 



10 



1/ \ 



,r60 r6. 



- y including 



H 



* 15 



20 



r62< 



(CH2),-3 R«- 



Compounds of structure (1) where is 



-NH ^R** , including 



N 
r61 



"NH(CH2)i-2 ^R^' 



where R^^ is — ^H, alkyl having 1, 2, 3 or 4 carfjon atoms or 
optionally protected substituted alkyl having 1, 2, 3, 4, 5 or 
6 carbon atoms including — CH3 and — CH2CH3, and R^ is 
— -O — or — S — are synthesized using scheme A, while 
scheme B is used when R^ is — CHj — and scheme C is used 
when R^ is — ISfH— . 



50 



Compounds of structure (1) where R^ is — R^ — R^ — 
(R3)2, including — R^— (CHJ,— N(R3)2, and R^ is —0— 55 
or — S — are synthesized using scheme A, while scheme B 
is used when R* is — CHj — and scheme C is used when R* 
is— NH— . 



Compounds of structure (1) where R^ is 




60 



65 



and R® is — 0 — or — S — are synthesized using scheme A, 
while scheme B is used when R^ is — CH2 — and scheme C 
is used when R^ is — ^NH — . Compounds containing these 
structures where R* is — S — or — O — , in general are 
synthesized using Scheme A and compound (103), e.g., 
using #1154-093 described in the examples below, and the 
corresponding protected alcohols. Compounds where R^ is 
— CH2 — are obtained by following Scheme B. Compounds 
where R^ is — NH — can be obtained using compound (119), 
e.g., using #1090-68 and protected aldehydes described in 
the examples below, by following Scheme C. 

Schemes £ and F illustrate the synthesis of alcohols 
containing an imidazole moiety (see^ e.g., Munk "J. Med. 
Chem.'' 40:18 1997). In scheme E, R^^ is R^ or alkyl having 
1, 2, 3, or 4 carbon atoms and R^^ is — OR^, —(CH^zfiR^ 
or — (CH2)i.2— R^MCH2)i.2— OR^ and Z4 is 0, 1, 2, 3, 4 
or 5. In scheme F, 2^5 is 1, 2, 3, 4 or 5, 



25 



where R^^ is — N— , — CH— or —C(CH^)^, R' 

>6 



is— H, 

— NH2 or — NH(CH3) and R^ is — O— or — S— are 
synthesized using scheme A, while scheme B is used when 30 
R^ is — CH2 — and scheme C is used when R* is — NH — . 



Scheme E 

BrC=N * R«N=C=NH 
H2NCH(CH2R*^CH(OCH2CH3)2 
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Scheme F 



N a 

N 
H 



CHjR^^ 



r5o(CH2)z4NH2+ BiC^N -2-^ r5o(CH2)z5N=C=NH 
H2N CH2CH(OCH2CH3)2 



r50(CH^Z5N=C=NH 



r5o(CH2)z5NH- 



o 



Scheme D is also used to obtain compounds of structure 
(1) where R^ comprises, for example, a moiety such as 
-(CH2)2-4.-NH((CH2)2.4NH)o.4-(CH2)2-4NH2, 

-NH2)2» or when R is 



IS H, 

33\ 



CH3, 

the other R 
(48), (49) 



CH2CH3 a 
is 



^CH2)2-4NH(CH2)2^N((CH2)n.4 

— R*^(CH2);sr(R^)2 and one R^ i 
protecting group or —(CH^^—NCR^\ and 
-<CH2).-N(R^% -CH(N[R^^]2)-N(R- 
or (50). 

Converting a compound of structure (4) to a compound of 
structure (1) is accomplished by a method comprising dis- 
placing R^'*. In this method, R^"* usually is — ^Br and R^^ is 
usually — CH— . 

R^ groups containing a diol, e.g. 
(CH20H)2— CH— (CH2)„-4>^ where n is 1^, are con- 
verted by cleavage to the aldehyde CHO — CH2-;<CH2)„ — 
0 — using sodium periodate. Reductive alkylation of the 
aldehyde to a primary or secondary amine is accomplished 
using N(R^)2 where R^ independently is — ^H, C^.^ alkyl, 
including — CH3, — CH2CH3 or a protecting group, usually 
—II or — CII3. 

To the extent that a compound within the claims' scope 
can not be directly synthesized using the schemes above or 
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the examples below, the artisan will employ straightforward different fluoropbor such as fluorescein or rhodamine 

methods known for preparing such compounds, see e.g., B. capable of energy transfer. The two oligonucleotides recog- 

M. Trost & I. Fleming, cds. "Comprehensive . Organic nize a complementary sequence in which the 3' end of the 

Synthesis", volumes 1-8, Pergamon Press; M. Fieser, ed. target sequence binds the oligonucleotide bearing the 

"Fieser and Fieser's Reagents for Organic Synthesis", vol- 5 3'-terminal fluorophor and the adjacent 5' sequence of the 

umes 1-17, John Wiley & Sons; and A. F Finch Ed, target binds to the oHgonucleotide bearing the 5' terminal 

"Theaheimer's Synthetic Methods of Organic Chemistry", Auorophor. Binding is a^ayed by measuring a change m 

volumes 1^9, latest editions, S. Karger AG. ^''''l'**?'?' f^""' °' the ohgonuc eotodes when 

they bmd m tandem accordmg to the illustrated model In 

Uses for the Compounds of this Invention jo ^^^^^ embodiments only a single labeled oHgonucleotide is 

employed in the hybridization method. The oligonucleotides 

The compounds of this invention find uses in the of this invention thus are useful in solution phase hybrid- 
diagnostic, analytic and therapeutic fields, or as intennedi- ization diagnostics, i.e., it is not necessary to perform a 
ates in the preparation of compounds useful in such fields. phase separation in order to detect labeled oligonucleotide 

The R^-substituted compounds of structure (1) are useful binding, 
as intermediates in the preparation of the labeled biopoly- Detecting a target base sequence in a nucleic acid or an 
mers of structure (1), wherein a biopolymer is rendered oligonucleotide using an invention oligonucleotide is 
fluorescent or otherwise detectably labeled by linkage to the accomplished by a method comprising (i) mixing or con- 
poly cyclic substructure. It is most convenient, however, to tacting a sample suspected of containing a nucleic acid with 
use the appropriate strucmre (1) compounds as monomers in an optionally labeled invention oligonucleotide comprising 
the preparation of nucleic acids or oligonucleotides. The at least about 7 bases, usually about 12-30 bases, where the 
labeled biopolymers are employed in diagnostic assays or protecting groups have been removed, (ii) allowing time 
preparative procedures in the same fashion as other suflicient for the invention oligonucleotide to bind to the 
fluorophor-labeled biopolymers, e.g. in fluorescence polar- target base sequence, (iii) separating unbound invention 
ization methods, fluorescence activated cell sorting, oligonucleotides firom bound invention oligonucleotides and 
competitive-type EMIT immunoassays and the like. (iii) detecting the presence, absence or amoimt of bound 

The linker- and hydrogen-substituted compounds of invention oligonucleotide. Aspects of the invention include 

strucmre (1) are useful as intermediates to prepare materials conducting any one of these steps individually, which are 

suitable for use in affinity purification ofguanine or guanine- each part of the complete method. The use of known 

containing compounds, e.g., nucleosides. The structure (1) 3Q hybridization methods and conditions are applied to accom- 

compounds form hydrogen bonds with guanine and one can phsh the method. 

thus link the strucmre (1) base strucmre to an appropriate In this method to detect target base sequences, one may 

. support material or polymer to prepare an affinity resin optionally use invention oligonucleotide and target base 

suitable for binding to guanine-containing compounds, e.g., sequences that are substantially complementary to each 

nucleosides, nucleotides and oligonucleotides or similar 3^ other, i.e., sequences having 1 or no mismatches per about 

compounds containing guanine analogs, e.g., every 15-30 bases. The target base sequences to be detected 

7-deazaguanine. Typical linker-derivatized structure (1) are generally present in a cell, in a cell or tissue extract or, 

compounds optionally contain a linker of structure usually, in a purified nucleic acid or oligonucleotide 

— (R'^^^ — ^R"*^, defined above in the discussion under preparation, e.g., a sequence encoding a portion of a 

Scheme A. 40 cytokine, cell surface molecule, an enzjnne such as famesyl 

The monomers are of particular use in preparing oligo- protein transferase or an oncogene such as neu, myc, raf, ras 

nucleotides for diagbostic or therapeutic use. Since oligo- or c-ras. The target base sequences to be detected are 

nucleotides having 2 or more nucleotides or nucleotide generally present in RNA or single stranded DNA, although 

analogs bearing the polycyclic substmctore will usually the invention oligonucleotides are also useful to detect base 

exhibit greatly increased Tm, such oligonucleotides are 45 sequences in duplex nucleic acids, 

particularly useful in therapeutic or diagnostic utilities Another invention embodiment is a method comprising 

where hi^ly stable duplex hybridization structures are incubating a cell with a deprotected invention oligonucle- 

desired. Since these oligonucleotides frequently are otide containing at least about 7 bases, usually about 12-30 

fluorescent, changes in the oligonucleotide fluorescence can bases, wherein the invention oligonucleotide is optionally 

be followed as the oligonucleotide binds to complementary 50 present in a transfection complex comprising the invention 

nucleic acid or oligonucleotide sequences. These changes oligonucleotide and a permeation enhancing agent. This 

are detectable as modifications in energy transfer, e.g., method is used to introduce optionally labeled invention 

fluorescence quenching or shifts in activation or emission oligonucleotides into cell cytoplasm or vacuoles. 

wavelength(s). One optionally conducts this method using invention 

The polycychc substructure labeled oligonucleotides are 55 oligonucleotides having a octanolrwater partition coefBcient 

employed in diagnostic or analytic methods in the same of about -0.5 to about 2,5, typically about 0,0 to about 2.0, 

fashion as other labeled ohgonucleotides. For example, the usually about 0.5-1.5, and a solubility in water of at least 

oligonucleotides are used in hybridization methods in which 0-001 fig/mL. However, in these embodiments, no perme- 

an antibody capable of binding base-paired structure (1) is ation enhancing agent is usually needed to introduce the 

used to detect binding of the oligonucleotide to a target 60 compound into the cells. 

nucleic acid sequence. In addition, changes in fluorescent One can use oligonucleotides containing 1, 2, 3 or more 

character can be assayed as described above. Typically, 2, 3 invention base(s) to detect a base pair mismatch in a nucleic 

or more polycydic si^structure labeled oligonucleotides are acid sample using ribonuclease protection assay methods 

used in a hybridization assay. One oligonucleotide is labeled described in U.S. Pat. No. 5,589^29. Thus, one can use the 

at its 3' end with a polycyclic substnicmre containing 65 invention compounds as described in step (a) or step (b) or 

nucleotide while the other nucleotide is labeled at its 5* end step (c) (or, in sequence, steps a, b, or steps b, c or steps a, 

with the same or another polycyclic substructure or with a b, c) of claim 1 of U.S. Pat. No. 5,589^29 to practice that 
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claimed method (or to practice necessary steps in the claim labeled oligonucleotides containing 1, 2, 3 or more invention 

1 method, e.g., contacting an RNA probe containing an bases, see e.g., U.S. Pat. Nos. 4,910300, 5,093,232, 5,124, 

invention base with a singje stranded nucleic acid to form a 246, 5,202,231, 5,258,506, 5,202,231, 5,525,464, 5,578.717, 

duplex). One can similarly use oligonucleotides containing 5,578,715, 5,578,467, 5,591,584, 5,599,932, 5,599,668, 

an invention base(s) to screen mammalian genomic DNA 5 5,593,841, 5,578,458, 5,589^32, 5,589^33, 5,589,339, 

samples for insertions, deletions or substitutions using 5,589,342, 5,593,830, 5,593,831, 5,593,832, 5,593,836, 

screening assay methods described in U.S. Pat. No. 5,589, 5,593,840, 5,593,841, 5,593,863, 5,604,097, 5,604,099, 

330. Thus, one can use the invention compounds as 5,605,793, 5,605,794, 5,605,796, 5,605,798, 5,605,824, 

described in step (ii) or step (iii) or step (iv) or step (v) (or, 5,606,047, 5,608,063, 5,614,617, 5,594,117, 5,633^64, 

in sequence, steps i, ii, or steps i ii,m or steps i 5,639,612, 5,639,611, 5,639,608, 5,639,647, 5,639,736, 

1 11, m IV, etc ) of claim 1 of 5,589330 to practice that 5,639,626, 5,641,631, International PubUcation Nos. WO 

clauned method (or to pracUce necessary steps in the claun 97/07246, WO 97/06252, WO 97/06183, WO 97/04787, 

1 method, e.g.,contactmg an ohgonucleou^ 97/05280, WO 96/41017, WO 96/41012, WO 

invention base with an immobdized genomic DNA sample ' ' 

to form a triplex or duplex). One can similarly use ohgo- , x^r^ f^^.r^^^^^ x^?^ f^^,r^^net^ 

nucleotides containing an invention base(s) to design a 15 WO 96/41016, WO 97/04126, \\^0 97/^ 

synthesis method for an array of materials to be synthesized European Publication No. 761 822. For each of these 

on a substrate using methods described in U.S. Pat. No. appUcaUons, one would use an invention ohgonucleoUde m 

5,593,839. Thus, one can use the invention compounds as placc of one or more of the described oHgonuclcotidcs. To 

described in the first part of the second step or the second practice these and other typical applications, one will typi- 

part of the second step or the third part of the second step or 20 cally use an invention oligonucleotide in one or more of the 

the fourth part of the second step or (or, in sequence, in the steps needed to practice the methods described in these 

first step and in the first part of the second step, etc.) of claim publications. In many of these applications, one will use an 

1 of 5,593,839 to practice that claimed method (or to invention oligonucleotide containing (i) about 7-50, usually 

practice necessary steps in the claim 1 method). about 8-30 linked monomers, usually where the oligonucle- 

One can also use the invention monomer compositions 25 otide has a uniform polarity, (ii) 1, 2 or 3 invention bases, 

containing a 5'a-^^S-thiotriphosphate group or a 5' triphos- (iii) purine and pyrimidine bases having a base sequence 

phate group linked to 2',3'-dideoxyribose to perform dideoxy complementary or substantially complementary to a target 

DNA sequencing methods. One may use invention mono- sequence, i.e., a defined base sequence having no more than 

mers in kits that optionally contain buffers or enzymes about 1, 2, 3 or, for relatively long oligonucleotides (about 

suitable for DNA sequencing. The invention monomers may 30 35-50-mers), 4 base mismatches, and, optionally, (iv) other 

be advantageously used in enzymatic DNA sequencing moieties or features, which are readily apparent to the skilled 

protocols because the invention monomers, which act as artisan who has read this disclosure, that facilitate using the 

cytosinc surrogates, have a high affinity for guanosinc and oligonucleotide in the intended application, e.g., (i) a free 

may perform better than cytidine 5' triphosphate in sequenc- hydroxyl group at the 3' terminus for applications where the 

ing reactions, particularly where the DNA to be sequenced 35 one wishes to use the invention oligonucleotide as a primer 

contains a high proportion of guanosine residues, which can in enzyme-mediated chairi elongation applications, (ii) a 

cause sequencing problems. fluorescent label, enzyme label or radiolabel to facilitate 

Oligonucleotide analogs containing 1, 2, 3 or more inven- oligonucleotide detection in a particular assay, (iii) biotin 

tion bases are also suitable for binding to open complexes in linked to a convenient location such as the 5' or 3' terminus, 

cells or cell lysates of eukaryotic or prokaryotic ceUs. Open 4n or (iv) a free 5* hydroxyl group for enzymatic phosphory- 

complexes may also arise in systems comprising at least lation. 

partially purified cell components, e.g., RNA polymerase, When one adapts the presently claimed compounds to 
nucleotide triphosphates, suitable transcription cofactors, uses known in the art, one wiU use known hybridization 
DNA binding proteins and duplex DNA capable of tran- conditions, enzyme (such as polymerase, RNase or DNase) 
scriplion. Open complexes are regions of single stranded 45 reaction conditions or detection technologies to design the 
DNA or RNA that occur at least transiently during duplex invention oligonucleotide in a way that does not interfere 
nucleic acid metabolism, e.g., during DNA replication or with the intended use, or in a way that improves the intended 
RNA transcription in the nucleus, cytoplasm, plastids or use. For example, when a previously described assay or 
mitochondria. Such DNA replication or RNA transcription diagnostic method calls for conducting an oligonucleotide 
may involve metabolism of cellular, viral or other nucleic 50 hybridization or enzyme amplification protocol at a specified 
acids. One can use binding of invention oligonucleotides to temperature, one will have the option of increasing the 
single stranded open complex sequences to affect nucleic hybridization or enzyme amplification temperature due to 
acid metabolism, e.g., one may inhibit RNA transcription or the presence of an invention base(s) in the oligonucleotide, 
one may use the oligonucleotides, which are optionally Thus, one can use the enhanced binding affinity or enhanced 
labeled, to detect the presence of open complexes. Such 55 binding specificity property of the invention oligonucle- 
oligonucleotides will typically comprise about 8 to 30 otides to increase hybridization stringency. Similarly, when 
monomers, usually about 12-21 monomers, having a one intends to use an invention oligonucleotide in a poly- 
sequence complementary to a single stranded open complex merase chain reaction (PGR) amplification method, one 
region(s) involved in the initiation of a nucleic acid meta- would initially test to see if the presence of an invention base 
bolic event, e.g., initiation of RNA transcription in a pro- 60 at the 3* terminus improves the oligonucleotide's primer 
moter or transcription initiation region. Workers have function and then adjust the reaction conditions accordingly, 
described open complexes and their formation in vitro and e.g., by altering the primer to target sequence ratio, primer 
in cells, e.g., Biuns, "Biochem. J." 317:305-11 1996, Smith, concentration or by altering the temperature at which one 
"Proc. Natl. Acad. Sci. USA" 93:886^-72 1996, Jiang, "J denatures and amplifies the PGR reaction. If the presence of 
Biol Chem" 268:6535-40 1993. 65 an invention base significantly affects polymerase -mediated 
Workers have described other applications that one can primer elongation, then one could choose to design inven- 
practice in a similar straightforward manner using optionally tion oligonucleotides for this use without an invention 
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base(s) at the 3* terminal 1, 2 or 3 monomer positions. invention polycyclic bases, usually 2, 3 or 4, which can be 

Skilled artisans routinely design diagnostic or assay proto- the same or different polycycles but generally are the same 

cols by testing varying temperature, salt composition and inveniionpolycycle.They also optionally are copolymerized 

concentration, pH, oligonucleotide concentration, enzyme purine or pyrimidine bases containing known alkynyl 

concentration, enzyme reaction buffer, net oligonucleotide 5 substitutions (e.g., U.S. Pat. Nos. 5,645,985 and 5,594,121), 

charge, or oligonucleotide base, sugar or linkage structure ^ particular pyrimidine bases substituted at the 5 position 

during assay development. ^ carbon atom which is bonded to another atom by a Pi 

. tj 'uj bond, or the fluorescent cytosine derivatives of Inoue et al. 

Embodiments mclude a method compnsmg preparmg a ^ ^^^^ ^ 

series of oligonucleotides, each having the same base j r *u- • ^ *u i- i 

r- . • . ' ^ ' tn The compounds of this invention, or other ohsonucle- 

sequence, which sequence contains 2 or more cytosme 10 ^^"^ ^"Jf:"^r wi. uuo m *^uuuu, ui umsi uuguuuw^ 

. ^ . u u 4: *u * • otides capable of forming high meltmg duplexes (e.g. the Pi 

bases, where each member of the senes contams an mven- ujju j- jtf\ iii- - j 

t;/^« K«c^ ..f ir, «io^^ r.f r.^ bouded bases discussed above), are useful in unproved 

tion base 01 structure (3) m place ot one or more or the j r 1 i_ • i- 

, . . , c u 1- 1 » * 11 • methods for polymerase chain reaction ("PGR ) or hgase 

cytosme residues. Such oligonucleotides typically comprise , . » • /«t t n j j . • r 

n\ w * £i ^ ' \i r\ 1* 1 cham reaction ( LCR ) amplification and detection of 

2 to about 6 cvtosme residues. One prepares ohgonucle- . . ^ u * *u u- u i- 

, * • • ' • *• u * u * ■ i< nucleic acids. In one embodiment, the high meltmg oligo- 

otides contammg an mvention base at each cytosine position 15 . i_t- -. irS^n 

J „ * I ^.j ^f, nucleotides are used as one or both pnmers m classical PGR 

and optionally one prepares oligonucleotides contaimng an . . Particularlv in PGR nmce««e<; the 

invention base at each of two cytosine positions, at each of as probes m LCR. Particularly in PGR processes the 

^ . J f elevated meltmg temperature or duplexes with high melting 

three cytosine positions and so forth. One uses this method . • j *i_ j . n 1 

♦ ^ » • u- u 1' 1 ♦•J / \ A * pnmers avoids the need to thermally cycle the reaction 

to determine which oligonucleotide(s), compared to a con- i , ^ //.^o. r.^-© ^ 

^ 1 ■ . . L / \ C *• 1 because at these elevated temperatures (about 68 to 95 C, 

trol contammg no mvenUoo base(s), has optimal properties 2U ■ ^ i_ .^eo ^ j • - 

^ J • J 1- »• u u -J- *• 4= preferably greater than about 75 C; the denvative pnmer 

for a desired apphcation, e.g., hybndization afi&mty for use .„ * * 1 . *i. 

t • u u *- . . will continue m at least some proportion to anneal to the 

as a probe or optimal antisense activity for inhibiting target . * u * * • j. * -if * /^ j* -n 

^ • • « "& & target but extension product will not. Ordinary pnmers will 

gene expression m a cell. fu u j *u 1 •« * 

not hybridize and the polymerase will not initiate transcrip- 

Structure (5) monomers, when triphosphorylated and con- 1^0^ ^ntil the reaction mixture is cooled to a level at which 

taining R^ ribosc or deoxyribosc derivatives that are chain primer wiU anneal to the target sequence (usuaUy, about 

tenninating (e.g. where the 3' position is not hydroxyl), are 550 ^ y jj^^ elevated temperature that is chosen for use with 

useful in methods for fluorescent chain-terminatmg the high-melting derivative oHgonucleoUdes (a temperature 

dideoxynucleotide sequencmg m the same general fashion suitable for all of annealing, extension and melting) is one • 

as ddNTPs having other linker-attached fluorophores. ^ which a substantial proportion of the extended primer 

Since oligonucleotide compounds such as those of struc- population (about 10 to 90 mole %) is found dissociated 

ture (2) are capable of participating in Watson-Crick base from the taiget, but sufficient unextended primer is bound to 

pairing they will hind to nucleic acids and therefore are permit extension. Optimally, this is about from 85 to 95** C., 

useful in detecting the presence of nucleic acids. Bases of ordinarily 92 to 95** C. Alternatively, the optimal tempera- 

strucmre (1) in such oligonucleotides will recognize gua- ture is determined empirically by simply selecting a range of 

nosine as its complementary base in natural nucleic acids. temperatures within the melting range of the extended 

Invention oligonucleotides, including many structure (2), sequence, but within the annealing range of the derivative 
(2A), (2B) and (2C) oligonucleotides capable of forming primers, and measuring the amount of amplification product 
high melting duplexes with complementary sequences, are to achieve satisfactory levels for the diagnostic or prepara- 
useful in numerous applications, including antisense or ^ tive processes at hand. Amplification methods have been 
codeblocking utilities in vivo or in vitro as well as diagnos- described, e.g., U.S. Pat. No. 5,667,974. 
tics and probe uses. High melting duplexes are those having An exemplary method to use an invention oligonucleotide 
melting temperatures substantially above the melting tem- to amplify a nucleic acid base sequence comprises, provid- 
peratures of oligonucleotide or nucleic acid duplexes of the ing an invention oligonucleotide and target nucleic acid 
same sequence that contain the ordinary, naturally occurring 45 sequence that forms a complex having a Tm of about 85 to 
bases, e.g., adenosine, cytidine, uridine, guanosine, thymi- 95° C, optionally heating the complex to about 85 to 95° G. 
dine and the like. "Substantially above" means that the (e.g., to a temperature within about 5° G. of the Tm) to 
derivative oligonucleotide, when hybridized with its provide a heated complex, and optionally mixing the heated 
complementary sequence, will not dissociate from the complex with a DNApolymerase such as Taq polymerase or 
duplex until the temperature is raised from about 2 to 40** C., 50 other suitable heat stable enzyme. In this method, the 
ordinarily about 8 to 40° C., above the dissociation tem- complex and the heated complex is typically a duplex. The 
perature of the same oligonucleotide having the analogous polymerase reaction will contain cofactors and buffer con- 
normal A, C, U, G or T bases, but to no greater temperature ditions suitable for amplification purposes, 
than about 95** C. This is known as the D Tm. Ordinarily, D It will be understood that the optimal temperature will 
Tm is measured by comparing control oligonucleotide bind- 55 vary considerably depending upon the derivative bases 
ing to complementary RNA or DNA with the binding of test chosen, whether they are adjacent or separated by other 
oHgonucleotide to the same RNA or DNA, foUowing, e.g., bases, the number of bases in the primers (the highest' 
the method described in Jones etal., "JOG" 58:2983 (1993). annealing temperatures are found with primers having 

Some of the invention riboside and deoxyriboside com- greater than about 18 bases or base analogs), the proportions 

poimds are fluorescent. The compounds remain fluorescent 60 of pyrimidines and purines and the like. The heat stable 

upon incorporation into oligonucleotides and are visible polymerase useful in this system is for example Taq poly- 

intracellularly, including when bound to target sequences merase or other suitable heat stable enzyme. Thus, whatever 

after direct injection or after Iransfectioninlo.cells in accord the optimum temperature chosen, the amplification and 

with known methods. priming reactions are conducted conventionally but at a 

One can optionally prepare oligonucleotides having tan- 65 substantially constant temperature, 

dem arrangements of the novel bases. In general, such Not only do the oUgonucleotides of this invention facili- 

tandem arrangements will contain firom 2 to about 10 tate PGR or LGR processes, the fluorescent properties of the 
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primers also facilitate detection of the extension products. 
The extension products are readily separated from the unex- 
tcndcd primers, e.g. on the basis of molecular weight, and 
detected by their fluorescence, thereby avoiding staining 
with agents such as ethidium bromide. In some 5 
embodiments, the fluorescence is enhanced by using NTP's 
comprising the fluorescent substructures of this invention in 
primer extension so that the fluorescent NTPs are incorpo- 
rated into the extension products as weU. The polycyclic 
substructure used in the NTP's may be the same or different lO 
than the one incorporated into the primers. 

We incorporate herein all citations by reference with 
specificity. 

The following examples further illustrate and do not limit 
the invention. 

EXAMPLE 1 

The following example shows synthesis of representative 
starting materials and intermediates for making invention 20 
compounds. 



NO2 




OH 
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1019-97 



^v^S,^^CH2CH2CH2NITrFA 



1019-102 



XX 



1090-05A 



NH2 



HO^^^^s,^^^CH2CH2CH2NHCB2 

XT 



1090-05 



CXTf 



1019-46 



35 Conditions: a: Hj, 10% Pd/C, CH3OH, RT; b: HC(0CH3)3, 
methanesulfonic acid, 47.8%; c: Tf^NPh, CH^Cl^/DMP, 
K2CO3, RT, quantitative; d: HC^CCH^NHTFA, PdCPhgP) 
4, Cul, TEA, DMF, 60%; e: H2, 10% Pd/C, CHgCOOEt; f: 

^ con. NH40H/Dioxane (1/1), RT; g: PhCH20C(0)Cl, pyri- 
dine; h: 3N HCl/EtOH (1/1), RT, 1 hr or 40° C, 30 min. 




OAc 



1090-15 



6,028,183 



55 

-coatinued 




OAc 

1U9U-18 



(CH2)3NHCBZ 




H2 (in a balloon), 
10%Pd/C, 
CH3QH, 68% 



OH 

1090-22 




OH 

1090-25 




FA, pyridine, 
CHaCla 



OH 

1090-26 
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DMTO' 




OPO2HTEAH* 

1090-31 



9-{3'-Aminopropyl)phenoxa2ine 

2-Aminoresorcmol (#1019-43): A ethanol solution (500 
mL) of 2-mtroresorinol (20 g; 0.129 mmole) was hydroge- 
nated (H^ in balloon) in the presence of 10% Pd/C (1 g) at 
room temperature overnight. The catalyst was filtered off 
through a celite pad. The filtrate was concentrated and 
purified by flash column chromatography, affording 15. 5g, 
96%of 2-aminoresorcinol. ^11 NMR (DMSO-dg): 8 8.80 (bs, 
2H), 6.15-6.30 (m, 3H), 3.85 (bs, 2H), 

Compound #1019-45: A trimethyl orthoformatc solution 
(100 mL) of 2-aminoresorcinol (6.0 g; 48 mmole) was 
treated with methanesulfonic acid (4.6 g; 48 mmole). After 
30 min stirring at room temperature, the reaction was cooled 
to 0** C. and was quenched with TEA (4.8 g; 48 mmole). The 
reaction mixture was concentrated and purified by column 
chromatography on silica gel, yielding 3.10 g of product, 
47.8%. NMR (DMSO-dg): b 10.3 (bs, IH), 8.53 (s, IH), 

7.18 (dd, IH, J08.O Hz, J=7.8 Hz), 7.12 (d, IH, J=8.0 Hz), 
6.73 (d, IH, J=7.8 Hz). 

Compound #1019-46: A CH^Cls/DMF (N,N- 
dimethylformamide) solution (20 mL/10 mL) of #1019-45 
(2.90 g; 21.4 mmole) was stirred with solid K^COg (14.8 g; 
107 mmole) at room temperature for 30 min, followed by 
addition of N-phenyltrifluoromethanesulfonimide (8.50 g; 
23.6 mmole). The resulting mixture was stirred at room 
temperature overnight, then diluted with CHjCl^, washed 
with water twice, dried, concentrated, and piu-ified, affording 
5.76 g, quantitatively, of # 1019-46. ^H NMR (CDCI3): 8 

8.19 (s, IH), 7.66 (d, IH, J»8.2 Hz), 7.48 (t, IH, J»8.2 Hz), 
7.33 (d, IH, J-8.2 Hz). 

N-TFA-propargylamine (#1019-44): Propargylamine (25 
g; 0,45 mmole) was dissolved in CH3OH (500 mL), fol- 
lowed by addition of ethyl trifluoroacetate (84 g; 0.59 mole). 
The resulting mixture was stirred at- room temperature 
overnight, then concentrated to dryness. The residue was 
redissolved in CHjClj (200 mL), washed with saturated 
NaHCOg aqueous solution. The organic phase was isolated, 
dried, concentrated to a brown residue (liquid). The product 
#1019-44, 54.0 g was distilled off, 78.8%. ^H NMR 
(CDQa): b 6.60 (bs, IH), 4.18 (m, 2H), 2.38 (s, IH). 

Compound #1019-97: A DMF solution (12 mL) of com- 
pound #1019-46 (4.9 g; 18.3 mmole), N-TFA- 
propargylamine (5.5 g; 36.7 mmole), Pd(PPh3)4 (4.3 g; 3.7 
mmole), Cul (1.78 g; 9.3 mmole) and TEA (3.7 g; 36.7 
mmole) was stirred at room temperature for 24 hours. The 
reaction mixture was diluted with CHjClj (100 mL), stirred 
with Dowex 1x800 (HCO3 — form). The organic phase was 
washed with water, dried, and purified, yielding 2.94 g, 60%, 
of product #1019-97. ^H NMR (CDC\^): b 8.18 (s, IH), 7.62 
(d, IH, J=9 Hz), 7.48 (d, IH, J=9 Hz), 7.36 (t, IH, J=9Hz), 
6.85 (bs, IH), 4.51 (d, 2H, J-3 Hz). 



Compound #1019-102: Aethylacetate solution (60 mL) of 

2Q #1019-97 (1.7 g; 6.3 mmole) was hydrogenated in the 
presence of 10% Pd/C (200 mg) at room temperature. The 
catalyst was filtered off. The filtrate was concentrated to 
dryness, used for next reaction without further purification. 
'H NMR (CDCI3): b 8.25 (bs, IH), 8.12 (s, IH), 7.49 (d, IH, 
J-8.2 Hz), 7.36 (dd, IH, J=7.6 & 8.1 Hz), 7.20 (d, IH, J=7.5 

25 Hz), 3.25 (q, 2H), 3.07 (t, 211, J-6.6 Hz), 2.01 (p, 211). 
Compound #1090-05A: A 1.4-dioxane solution (10 mL) 
of compound #1019-102 (1.0 g; 3.6 mmole) was treated with . 
concentrated NHOH (15 mL) at room temperature for 16 
hrs. The reaction mixture was concentrated to dryness. The 

30 residue was redissolved in CH2CI2 (30 mL), containing TEA 
(0.74 g; 7.35 mmole), cooled to 0* C, followed by addition 
of benzyl cbloroformate (0.75 g, 4.4 mmole). The resulting 
solution was stirred at room temperature for 4 hrs, washed 
with HjO, dried and purified by flash column chromatog- 

35 raphy to give 0.81 g, 71% of #1090-05A. 

Compound #1090-05: A ethanol solution (10 mL) of 
compound #1090-05 A (0.81 g, 2.6 mmole) was treated with 
3 NHCl aqueous solution (10 mL) at 40° C. for 1 hr. The 
reaction mixture was concentrated to dryness, azeotroped 

40 with CH3CN three times. The crude product was used for 
next reaction, without further purification. ^H NMR 
(CDQs): 6 7.25-7.40 (m, 5H), 7.20 (t, IH), 6.78-6.90 (m, 
2H), 5.07 (s, 2H), 3.17 (t, 2H, J=6.8 Hz), 2.67 (t, 2H, J=7.6 
Hz), 1.79 (p, 2H). 

45 Compound #1090-18: Compound #1090-15 has been 
described (Lin, "JACS" 117:3873^3874, 1995). A mixture 
of Compound #1090-15 (1.5 g; 2.6 mmole), #1090-05 (2.6 
mmole) and DBU (0.8 g; 5.2 mmole) in CH2CI2 (30 mL) 
was stirred at room temperature overnight. The reaction 

50 mixture was washed with 10% citric acid aqueoiis solution, 
dried and purified on silica gel column chromatography, 
affording 1.21 g, 69% of product. ^H NMR (CDCI3): b 8.86 
(s, IH), 7.99 (s, IH), 7.29-7.40 (m, 5H), 7.15 (t, IH), 6,98 
(d, IH, J=8 Hz), 7.78 (d, IH, J«7 Hz), 6.27 (t, IH), 

55 5.20-5.23 (m, IH), 5.09 (s, 2H), 4.78 (bs, IH), 4.30-^.52 
(m, 3H), 3.20-3.30 (q, 2H), 2.65-2.80 (t+m, 3H), 2.10-2.20 
(2 s+m, 7H), 1.80 (p, 2H). 

Compound #1090-22: Compound #1019-18 (1.20 g; 1.78 
mmole) was treated with saturated NH3 in CH3OH (200 mL) 

60 at room temperature for 5 days. The reaction mixture was 
concentrated to dryness and purified by flash column chro- 
matography affording 0.63 g of product #1090-22, 70%. ^H 
NMR (CDCl3+10% CD3OD): b 7.30-7.38 (m, 5H), 6.81 (t, 
IH, J=7.3 Hz), 6.73 (d, IH, J=7.9 Hz), 6.60 (d, IH), 6,20 (t, 

65 IH), 5.10 (s, 2H), 4.36-4.40 (m, IH), 3.94-4.0 (m, 11^, 
3.70-3.90 (m, 2H), 3.23 (t, 2H), 2.58 (t, 2H), 2.30-2.40 (m, 
IH), 2.10-2.24 (m, IH), 1.70-1.82 (m, 2H). 
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yielding 
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AcO 




1. PhjP, CH2C12- 
reflux, 3 hr 



-ecu. 



2. 1.1 eq. 2-aimjio-3- 
nitro-phenol, 1.S eq. 
DBU,241ir 



OAc 



02N>, 




10 



Compound #1090-25: Compound #1090-22 (0.6 g) was 
dissolved in ethanol (10 mL) and was hydrogenated (H2 in 
a balloon) in the presence of 10% Pd/C (50 mg) at room 
temperature for 4 hrs. The catalyst was filtered off, washed 
with CH3OH. The filtrate was concentrated and dried to 
afford 0.3 g, 68% of product. NMR (CD3OD): 6 7.38 (s, 
IH), 6,7-6.82 (m, 2H), 6.57 (d, IH, Jo6.3 Hz), 6.23 (dd, IH, 
J-6.5 Hz, J«6.7 Hz), 4.35-4.38 (m, IH), 3.80-3.90 (m, IH), 
3.70-3.80 (m, IH), 2.60-2.80 (m, 4H), 2.10-2.30 (m, 2H), 
1.75-1.86 (m, 2H). 

Compound #1090-26: Compound #1090-25 (0.13 g, 0,34 
mmole) was dissolved in DMF/CHjCl^ (1 mL/3 mL), fol- 
lowed by addition of 9-fluorenylmethyl-N- 
succinimidylcarbonate (FMOC-NHS, 0.14 g; 0.41 mmole). 
After 1 hr of stirring at room temperature, to the reaction 15 
mixture pyridine (140 mg, 1.7 mmole), and DMT-Cl (4,4'- 
dimethoxytrityl chloride; 0.14 g 0.41 mmole) were added. 
The resulting mixUire was stirred at room temperature for 2 
hrs and then diluted with CHjClj, washed with water twice. 
The organic phase was isolated, dried and purified by flash 20 
column chromatography on silica gel, to give 153 mg, 49%, 
FAB HRMS (high resolution mass spectroscopy) calculated 
for M+H^ 899.416, found 899.366. 

Compound #1090-31: Compound #1090-26 (150 mg; 
0.167 mmole) in CH2CI2 (2 mL) was added to a 0"" C. cold 
CH2CI2 solution (1 mL) of PA (0.25 mL of IM CH^Clj 
solution, 0.25 mmole) and pyridine (66 mg, 0.83 mmole). 
The resulting mixture was then gradually warmed to room 
temperature. After 30 minutes, the reaction mixtiu% was 
diluted with CH2CI2, washed with 1 M TEAB aqueous 30 
solution, dried and purified on silica gel, eluted with 5% 
CH3OH/CH2CI2, then 15%, HjO in CH3CN. The combined 
fractions of product were concentrated and then partitioned 
between CH2CI2 and IM TEAB aqueous solution dried. 



25 



180 mg of H-phosphonate derivative, quantita- 35 



40 



45 



50 
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Cbmpound #1090-68: A CH^Q^— CCl^ (40 mL/40 mL) 
solution of 5-bromo-3'-5'-diacetyl-2'-deoxyuridine (2.0 g, 
5.1 mmole) and triphenyl phosphine (2.0 g, 7.6 mmole) was 
heated at reflux for 3 hrs. The reaction mixture was cooled 
to room temperature, followed by addition of 2-amino-3- 
nitrophenol (0.79 g, 5.1 mmole) and DBU (1.6 g, 10 
mmole). After stirring at room temperature overnight, the 
reaction mixture was washed with 10% citric acid aqueous 
solution, dried, concentrated and purified on silica gel. The 
isolated product contained Ph3P=0, was treated with satu- 
rated NH3 in CH^^OH for 5 days at room, temperature. After 
removal of all solvent, the residue (crude #1090-51) was 
redissolved in CH3OH (100 mL), and hydrogenated with Hj 
in a balloon in the presence of 10% Pd/C and 4 N HCl in 
dioxane (200 mL, 0.8 mmole). After 5 hrs, the catalyst was 
filtered off, washed with CH3OH. The filtrate was concen- 
trated and purified by flash column chromatography to 
afford 0.83 g of #1090-68, in 50% yield for 3 steps. NMR 
(DMSO-Clg): 8 9.6 (b, IH), 6.56 (t, IH, J=8.0 Hz), 6.22 (d, 



25 



30 



35 



IH, J«7.8 Hz), 6.10 (t, IH, J=6.8 Hz), 5.96 (d, IH, J=7.1 
Hz), 5.17 (d, IH), 4,92-5.08 (m, 3H), 4.16-4.20 (m, IH), 
3.70-3.80 (m, IH), 350-3.60 (m, 2H), 2.0 (m, 2H). 

Phthalimidoacctaldehyde #1090-70A: A mixture of N-(2- 
hydroxyethyl)phthalimide (90 mg; 0.47 mmole), DCC (0.15 
g, 0.7 mmole), DMSO (1 mL) and dicbloroacetic acid (20 
mL) was stirred at room temperature for 2 hrs. The reaction 
mixture was diluted vdih CH2CI2, washed with H^O twice, 
dried and concentrated. The crude product #1090-70A con- 
tained DCC-urea by-product, was used for the next reaction 
without further piuification. 

Compound #1090-70: A DMF/CH3OH (0.5 mL/2 mL) 
solution of #1090-68 (0.2 g, 0.42 mmole), phthalimidoac- 
etaldehyde #1090-70 A (0.47 mmole) in CH3OH (2 mL) and 
acetic acid (85 mg, 1.4 mmole) was reacted with sodium 
cyanoborohydride (87 mg, 1.4 mmole) at room temperature 
for 4 hrs. The reaction mixture was concentrated, and 
piuified on silica gel, yielding 243 mg, 78%, yield of 
compound #1154-70. 'H NMR (000^+10% DMSO-d^: 6 
9.45 (s, IH), 7.60-7.68 (m, 2H), 7.50-7.58 (m, 2H), 6.51 (t, 
IH, J-8.2 Hz), 6.12 (d, IH, J-8.2 Hz), 6.06 (t, IH, J«6.7 
IIz), 5.78 (d, III, J=8.0nz),4.94 (t, HI), 4.49-4.52 (m, HI), 
4.18-4.25 (m, 2H), 3.50-3.72 (m, 4H), 3.19-3.30 (m, 2H), 
1.85-2.05 (m, 2H). 

Compound #1090-89: Compound #1090-70 (35 mg; 69 
mmole) was dissolved in CH2CI2 (2 mL) and pyridine (0.5 
mL), then reacted with DMT-Cl (28 mg, 83 mmole) at room 
temperature for 3 hrs. The reaction mixture was worked up 
and purified by flash column chromatography, yielding 44 
mg, 78.6% of corresponding 5'-0-DMT-derivatives. ^H 
NMR (CDCI3): 8 7.80-7.90 (m, 2H), 7.68-7.75 (m, 2H), 
7.18-7.50 (m, lOH), 7.0 (b, IH), 6.80-6.90 (m, 4H), 6.77 (t, 
IH), 6.30-6.42 (m, 2H), 5.94 (d, IH, J=8.0 Hz), 5.50 (bs, 
IH), 4.50-^.60 (m, 11^, 4.10-4.18 (m, IH), 3.86-4.0 (m, 
2H), 3.25-3.30 (2s, 6H), 3.28-3.52 (m, 4H), 2.58-2.68 (m, 
IH), 2.20-2.32 (m, IH).. 

Compound #1090-91: Cbmpound #1090-89 (44 mg) was 
converted into 3 -H-phosphonate in normal fashion, 52 mg, 
in 77% yield. 
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R — OH 



PhaP, DEAD 
CH2a2, RT 
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1. H2, 10% 

Pdyc 



OH 

12 

a: R = ^CH2CH2NHCBZ, 
b:R= - — CH2CH2CH2NHCBZ, 
c:R= CHlCH2N(CH3)2 



2. FMOC — NHS, 
DMF,RT 

3, DMT — a 
Pyridine, RT 




DMTO' 



a:R'= CH2CH2NHFMOC, 

b:R'= CH2CH2CH2NHFMOC, 

c: R' = CH2CH2N(CH3)2 

• Steps 1 & 2 are ooly applied to 

R= CH2ai2NHCBZ 

and CHtCHjCH^NHCBZ 



N-CBZ-3-aminopropanol (Compound #1154-74, CBZ; 
benzyloxycarbonyl): A CH2CI2 solution (300 mL) of 
3-aminopn)panol (10 g, 0.133 mole) and TEA (20 g, 0.2 
mole) was cooled to 0^ C, followed by slow addition of 
CH2CI2 solution (25 mL) of benzyl chlorofonnate (25 g, 55 
0.147 mole). The resulting solution was gradually warmed 
to room temperature. The stirring was continued at room 
temperature overnight. The reaction mixture was washed 
with H2O, dried and purified by flash column chromatog- 
raphy (on silica gel, CH3OH— CH2CI2) to yield 16.4 g, 
58.9% of title compound. NMR (CDQa): 6 7.26-7.40 ^ 
(m, 5H), 5.10 & 5.0-5.10 (s+m, 3H), 3.68 (q, 2H), 3.36 (q, 
2H), 2.58 (t, IH), 1.70 (p, 2H). 

N-CBZ-2-aminoethanol (Compound #1154-104): To 0° 
C. cold of CH2CI2 solution (300 mL) of 2-aminoethanol 
(10.8 g, 0.177 mole) and TEA (26.8 g; 0.265 mole) was 65 
added slowly a CH2CI2 solution (50 mL) of benzyl chloro- 
fonnate (33.2 g, 0.194 mole). After complete addition, the 



resulting solution was gradually warmed to room tempera- 
ture and stirring was continued at room temperature over- 
night. The reaction was worked up and purilied by flash 
column chromatography on silica gel, affording 22.8 g in 
66% yield of title compound. NMR (CDCI3): 6 
7.34-7.40 (m, 5H), 5.20-5.32 (m, IH), 5.14 (s, 2H), 3.74 (q, 
2H), 3.37 (q, 2H), 2.38 (bs, IH). 

y-y -Diacetyl-N^-(2",6''-dihydroxyphenyl)-2'-deoxy-5- 
bromo-cytidine (Compound #1154-093): A CCI4— CH2CI2 
solution (150 mL— 150 mL) of 5-bromo-3'-5'-diacetyl-2'-* 
deoxyuridine (15 g, 38.3 mmole) and PhjP (15 g, 57.5 
mmole) was heated at reflux under N2 for 3 hrs. The reaction 
mixture was cooled to room temperature, followed by addi- 
tion of 2-amino-resorcinol (5.2 g, 42 mmole) and DBU (8.7 
g, 57.5 mmole). The resulting solution was stirred at room 
temperature overnight. The reaction mixture was concen- 
trated to about volume, then poured into citric acid 
aqueous solution (7.5 g in 300 mL HjO) with vigorously 
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stirring. The precipitate was filtered off, washed with HjO, Compound 13c: Compound 11c (130 mg, 228 mmole) 

CH2CI2 then CH3CN, dried in a vacuum oven overnight, was treated with saturated NH3 in CH3OH (30 mL). After 2 

weighed 12.9 g, 67,8% yield of title compound. NMR days at room temperature the reaction mixture was oonceo- 

(DMSO-dg): 6 9.63 (s, 2H), 8.21 (s, IH), 8.0 (s, IH), 6.89 trated to dryness to give crude 12c. The crude 12c was then 

(t, IH, J=8.1 Hz), 6.33 (d, 2H, J«8.1 Hz), 6.10 (t, IH, J»7.4 5 dissolved in pyridine (2 mL) containing TEA (89 mg, 1.2 

Hz), 5.10-5.17 (m, IH), 4.12-4.30 (m, 3H), 2.30-2.40 (m, mmole), followed by addition of DMT-Cl (115 mg, 340 

2H), 2.06 &2.03 (2s, 6H). mmole). After 2 hrs at room temperature, the reaction 

General procedure for synthesis of compounds lla-llc fixture was concentrated, partitioned between CH^Cl^ and 

and 12a-12c: Compounds lla-llc: To a CH^Cl, solution of ^^^^^^ ^^^^^ ^^^^^ ^^^^^^^^ ^^^^ purified on 

proper protected amino dcohd^ 10 ^^.^ ^^^^^^ ^.^^ CIl30II/CII,Cl2, then 10% 

1^"^!"^^ \ u ^"^'^"^ ! Tr CH30H/CH,C1, to yield compound 13c, 60 mg, 34% yield. 

#1154.093 (1 eq). rating mixture was stined at rooin ^ ^0-7 30 (m, lOH), 6.86 (dd, 4H), 

temperature overnight. The reaction mixture was washed . , , « ^ n x ^ /-r -.»t t o ^ tt \ ^ -i-^ A iti 

with H,0, dried and purified by a siHca gel (eluted with ,"^2' ^'^n ^"^^^'J^'l ^} ^fJ^"}^; 

CH30H-i:H,Cy to yield compound lla-llc, normally 15 ^^i?^>' 

they contaminated with Ph3P=0. However, the crude com- 4.0^.10 (m, 2H), 3.77 & 3.79 (2s, 6H), 3.32-3.45 

pound lla-llc was direcUy used for compounds 12a-12c i^^ 2H) 2.62-2.74 (m, 3H), 2.39 (s, 6H), 2.20-2.34 (m, IH). 

without further purification. Compound 13a (1154-175): A methanol solution (150 

Compound 12a-12c: Crude compound lla-llc was mL) of compound 12a (1154-111, 2.2 g, 4.3 mmole) was 

treated with saturated NH3 in CH3OH at room temperatxu-e 20 hydrogenated (H2 in a balloon) in the presence of 1 0% Pd/C 

for 1-3 days. After removal of solvent, the reaction mixture at room temperature overnight. The catalyst was filtered off, 

was purified by flash column chromatography on silica gel washed with CH3OH. The filtrate was concentrated to 

(eluted with CH3OH— CHjQa), to yield compounds dryness. The crude unprotected 12a derivative was dissolved 

12a-12c. in DMF/CH2CI2 (10 mL/5 mL), and reacted with FMOC- 

Compound 12a (1154-106. 1154-111): Compound 12a 25 nhS (1.73 g, 5.1 mmole) at room temperature for 1 hr The 

(2.5 g, 40.8%, 2 ste^) was prepared from 1154^93 (61) g, ^^^^^-^^ ^^^^^ CH^Cl^ (50 mL), con- 

ILT^^^l ll^t^^J^'^.f^u '^^^^^ pyridine (1.0 g, 12.7 mmole), foUowed by addition 

^ .'m To lilS'^a^^' of DMT-a (1.8 g, 5 3 mmole). After 2 hrs the reaction 

7.25-7.36 (m, 5H), 6.82 (t, IH, J«=8.2 Hz), 6.60-6.80 (m, . i\, a ^ u a u 1 

2H), 6.46 (d,lH,J«8.2 Hz), 6.14 (t,lH,J«6.7 Hz), 5.22 (d, 30 mixture was worked up and purified by flash column 

110, 5.06-5.13 (s+s, 3H), 4.20^.25 (m, IH), 3.94-^.0 (m, ^^Ho'Se^ ^' f ^ ^ ..TZ^lt IaI' 

IKf] 3.79-3.81 (m, IH), 3.58-3.65 (m, 2H); 3.40-3.50 (m, F^B HRMS calculated for C,3H,,N,0,o (M+H^ 901.345, 

2H), 1.98-2.20 (m, 2H). 901.344. 

Compound 12b: Compound 12b (0.30 g; 57.6%) was Compound 13b: A ethanol solution (15 mL) of compound 

prepared from 1154-93 (0.5 g; 1.0 mmole) and compound 35 12b (1154-077) was hydrogenated (H^ in a balloon) in the 

1154-074 (0.31 g; 1.5 mmole). ^H NMR (CD3OD): 8 7.65 presence of 10% Pd/C at room temperature for 4 hrs. 

(s, IH), 7.24-7.36 (m, 5H), 6.79 (t, IH, J=8.6 Hz), 6.54 (d. Catalyst was filtered off. The filtrate was concentrated to 

IH, J=8.2 Hz), 6.36 (d, IH, J«=8.2 Hz), 6.23 (t, irf), 5.06 (s, yield unprotected 12b derivative. Unprotected 12b deriva- 

2H), 4.38^.45 (m, IH), 3.92-4.11 (m, 3H), 3.81' (q, 2H), tive (125 mg, 0.32 mmole) was dissolved in DMF (2 mL) 

3.38 (t, 2H), 2.30-2.40 (m, IH), 2.10-2.22 (m, IH). 40 and reacted with FMOC-NHS (130 mg, 0.38 mmole) at 

Compound 11c: Compoimd 11c (86 mg; 37.7%) was room temperamre for 1 hr. The reaction mixture was diluted 

prepared from Compound #1154-093 (200 mg, 0.4 mmole) with CH^^^ (5 mL) containing pyridine (132 mg, 1.7 

and N,N-dimethylaminoethanol (43 mg, 0.48 mmole). ^H mmole), followed by addition of DMT-Q (135 mg, 0.38 

NMR (CDQa): 6 10.9 (s,lH), 8.49 (s, IH), 7.97 (s, IH), 7.11 mmole). After 2 hrs, the reaction mixture was diluted with 

(t, IH, J=8.2 Hz), 6.77 (d, IH, J«=8.2 Hz), 6.55 (d, IH, J=8.0 45 CH2CI2 (10 mL), washed with H^O, dried and purified by 

Hz), 6.32 (dd, IH), 5.23-5.27 (m, IH), 4.40-4.46 (m, 2H), flash column chromatography to give compound 13b, 86 

4.33-4.37 (m, IH), 4.20 (t, 2H, J=5.5 Hz), 2.70-2.80 (m+t, mg, 29% yield. FAB HRMS calculated for C54H51N4OJ0 

3H), 2.30 (s, 6H), 2.20 (s, 3H), 2.10-2.20 (m+s, 3H). (M+H"") 915.360, found 915.361. 
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R = CH2QCH2CH2N3 R' = CH2CX:H2CH2NHFM0C 



Compound #1154-108: 2-Amino-m-cresol (2.1 g, 17,6 
mmole) was dissolved in trimethylorthoformate (15 mL), 
followed by addition of methanesulfonic acid (0.32 g, 3.4 
mmole, 20 molar %). The resulting solution was stirred at 55 
room temperature for 3 hrs. The reaction mixture was 
neutralized with TEA (0.4 g, 4.1 mmole), concentrated and 
purified on silica gel, eluted with 2% CHgOH/CH^Clj, 
affording 1.6 g, 72% yield. NMR (CDCI3): 6 8.10 (s, 
IH), 7.43 (d, IH, J-8.2 Hz), 7.31 (dd, IH, J-7.3 Hz, J-8.2 60 
Hz), 7.20 (d, IH, J«7.3 Hz), 2.68 (s, 3H). 

Cbmpound #1154-112: Compound #1154-108 (4.3 g, 32 
mmole) was dissolved in CCI4 (50 mL), followed by addi- 
tion of N-bromosuccimide (6.3 g, 35 mmole) and AIBN 
(240 mg). The resulting mixture was refluxed for 2 hrs under 65 
Nj. The reaction mixture was washed with H2O, dried, 
concentrated and purified by flash column chromatography 



on silica gel, eluted with 2% CH3OH/CH2CI2, affording 5.67 
g. of product, 82.7%. 'H NMR (CDOg): 8 8.16 (s, IH), 7.55 
(d, IH), 7.40-7.50 (m, 2H), 4.90 (s, 2H). 

Compound #1154-125: A CH2CI2 solution (20 mL) of 
Compound #1154-112 (0.28 g, 1.3 mmole), ethylene glycol 
(0.82 g, 13.2 mmole), silver triflate (0,5 g, 1.95 mmole) and 
2,6-di-t-butyl-4-methylpyridine (0.52 g, 2.6 mmole) was 
stirred at room temperature for 3 hrs. The precipitate was 
filtered off, washed with CHzCli- Th^ filtrate was washed 
with H2O, dried, concentrated and purified on silica gel, 
eluted with 3% CH3OH/CH2CI2, affording 0.17 g, 68% of 
product #1154-125. 'H NMR (CDQa): 8 8,14 (s, IH), 7.55 
(d, IH), 7.36-7.40 (m, 2H), 4.97 (s, 2H), 3.70-3.85 (m, 4H), 

Compound #1154-126: Compound #1154-125 (170 mg, 
0.88 mmole) was dissolved in CH2CI2 (^ mL) containing 
TEA (0.27 g, 2.6 mmole) and treated with CH3SO2CI (0.15 
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g, 1.32 mmole). After 30 miD at room temperature, the 
reaction mixture was washed with H^O, dried and oonceo- 
tratcd. The residue was dissolved in DMF (2 mL) followed 
by addition of sodium azide (86 mg, 1.3 mmole). The 
resulting solution was heated at reflux for 2 hrs. The reaction 5 
mixture was then partitioned between CH2CI2 and water. 
The organic phase was isolated, dried and purified on silica 
gel, cluted with 1% CH3OH/CH2CI2, to afford #1154-126, 
162 mg, 84%. NMR (CDCI3): 6 8.10 (s, IH), 4,99 (s, 
2H), 3.76 (t. 2H, J«4.8 Hz), 3.45 (t, 2H, J=4.6 Hz). 10 

Compound #1154-132: Compound #1154-126 (1.1 g, 5.0 
mmole) was treated with 3N HQ aqueous solution (10 mL) 
and ethanol (10 mL) at 50* C. for 1 hr. The reaction mixture 
was concentrated to dryness, azeotroped with CH3CN three 
times, used for next reaction without further purification. "^H 15 
NMR (DMSO-dg): 6 10.6 (bs), 6.85-7.20 (m, 3H), 
4.60-4.70 (m, 2H), 3.60-3.72 (m, 2H), 3.50-3.60 (m, 2H). 

Compound #1154-133: 5-Bromo-3',5'-diacelyl-2*- 
deoxyuridine (0.98 g, 2.5 mmole) and ?hj? (0.8 g, 3.0 
mmole) were dissolved in CCU/CH^Cl^ (10 mL/10 mL), 20 
and were heated at reflux for 3 his. The reaction mixture was 
cooled to room temperature, followed by addition of #1154- 
132 (0.62 g, HCl sail, 2.5 mmole) and DBU (0.78 g, 5.1 
mmole). The resulting mixture was then stirred at room 
temperature for 2 days, washed with 5%, citric acid aqueous 25 
solution, dried, concentrated and purified on silica gel. The 
isolated product contained Ph3P==0, without further puri- 
fication and used for next reaction. 



Compound #1154-138: The crude #1154-133 
(Theoretically 2.5 mmole) was treated with saturated NH3 in 
CH3OH at room temperature for 3 days, concentrated to 
dryness, and purified in silica gel, yield 0.28 g of 28% yield 
of product (2 steps). ^H NMR (CD3OD): b 7.56 (bs, IH), 
6.7^.85 (m, 2H), 6.64-^.70 (m, IH), 6.19 (t, IH, J=6.4 
Hz), 4.54 (s, 2H), 4.35^.40 (m, IH), 3.88-3.94 (m, IH), 
3.60-3.85 (m, 4H), 3.62 (t, 2H), 2.22-2.35 (m, IH), 
2.10-2.20 (m IH). 

Compound #1154-148: A CH3OH solution (25 mL) of 
Compound #1154-138 (160 mg, 0.38 mmole) was hydroge- 
nated (H2 in a balloon) in the presence of 10% Pd/C at room 
temperature for 2 hrs. The catalyst was filtered off, washed 
■with CH3OH. The filtrate was concentrated to dryness to 
afford #1154-148. 'H NMR (CD3OD): 8 7.63 (s, IH), 
6.81-6.86 (m, 2H), 6.70-6.74 (m, IH), 6.22 (t, IH), 4.65 (s, 
2H), 4.35^.40 (m, IH), 3.89-3.93 (m, IH), 3.68-3.84 (m, 
2H), 3.55 (t, 2H), 2.80-2.90 (m, 2H), 2.25-2.36 (m, IH), 
2.10-2.21 (m, IH). 

Compound #1154-149: A DMF solution (2 mL) of com- 
pound #1154^148 (170 mg, 0.43 mmole) was treated with 
FMOC-NHS (150 mg, 0.52 mmole). After 2 hr reaction at 
room temperature the reaction mixture was diluted with 
CH2CI2 (10 ™L) containing pyridine (0.7 g, 8.7 mmole), 
followed by addition of DMT-Cl (0.22 g, 0.65 mmole). After 
1 hr, the reaction was worked up and purified by flash 
column chromatography, to yield 232 mg, 58.2% of 5'-0- 
DMT-N-FMOC derivative #1154-149. FAB HRMS calcu- 
lated for C54H51N4O10 (M+H*) 915.360, found 915.359. 




#1154-135 
R= OCHaQIaNHa 
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#1154-137 





OH 

#1154-144 



3'-5'-Diacetyl-0^-sulfonyl-2'-deoxyuridine (#1154-136): 
ACH2CI2 solution (10 mL) of 3'^'-diacetyl-2'-deoxyuridine 
(0,76 g, 2.4 mmole), 2-mesitylenesulfonyl chloride (1.0 g, 
4.8 mmole), TEA (1.23 g, 12.1 mmole) and catalytic amount 
of DMAP (0.1 g) was stirred at room temperature overnight. 
The reaction mixture was diluted with CH2CI2, washed with 
5% citric acid aqueous solution, dried and pitfified on silica 
gel, eluted with CH2a2, 35% ethyl acetate in CHjClj, to 
give the title compound, 0.76 g in 63% yield. NMR 
(CDQa): 8 8.0 (d, IH, J»9 Hz), 6.99 (s, 10 2H), 6.13 (d, IH, 
J=9 Hz), 6.07 (t, IH, J=6.0 Hz), 5.15-5.20 (m, IH), 4.33 (s, 
3H), 2.70-2.85 (m+s, 7H), 2.73 (s, 3H), 2.09 (s, 3H), 
1.98-2.07 (m+s, 4H). 

3'-5*-0-(l,l,3,3-Tetraisopropyl-l,3-disiloxyanediyl)-9"- 
(aminoethoxy)phenoxazine (#1154-135): Compound 14 
(see preparation of 12a and 13a) was dissolved in DMF/ 
pyridine (2 mL/2 mL) followed by addition of 1,13^- 
tetraisopropyl-dichlorodisilane (0.28 g, 0.91 mmole). The 
stirring was continued at room temperature for 3 hr. The 
reaction mixture was concentrated to dryness. The residue 
was used for next reaction without further purification. 

Compound #1154-137: A CHjCl^ solution (10 mL) of 
compound #1154-136 (03 g, 0.61 mmole), compound 
#1154-135 (crude, 0.61 mmole), and DBU (0.46 g, 3.0 
mmole) was stirred at room temperature overnight. The 
reaction mixture was washed with 5% citric acid aqueous 
solution, dried, and purified by flash column chromatogra- 
phy (silica gel, CHsOH/CHjaj), to give 0.21 g, 38% of 
product #1154-137. FAB LRMS calculated . for 
C42H6iN60i3Si2 (M+H-^) 912, found 913. 

Compound #1154-144: Compound #1154-137 (0.20 g, 
0.22 mmole) in THF 1 mL, was treated with 1 M BU4NF in 
CH2CI2 solution (0.87 mL, 0.87 mmole) at room tempera- 
ture for 30 min, then concentrated to dryness. The residue 
was redissolved in CH2CI2 (5 mL) containing pyridine (173 
mg; 2.2 mmole), followed by addition of DMT-Cl (330 mg, 
1 mmole). After 2 hrs, the reaction mixture was washed with 
saturated NaHCOj aqueous solution, dried, and purified by 
flash column chromatography (siUca. gel, CH3OH/CH2CI2) 
to yield 0,14 g, 65% of product #1154-144. FAB HRMS 
calculated for Cs^Uss^eOiA (M+H*) 973.362, found 
973.364. 

EXAMPLE 2 

Antisense inhibition of T-antigen expression. The tested 
DMA oligonucleotide analogs, had the base sequence shown 



in Table I below. This sequence is complementary to a the 
12 base RNA sequence expressed by the large SV40 virus 
20 large T antigen gene, 5' GTA GTG AGO AGG 3' (SEQ ID 
NO. 1). In the tested oligonucleotides, which are shown in 
Table I, each linkage was a phosphorothioate linkage and all 
sugars were 2'-deoxyribose. 

25 

The tested oligonucleotides had the base sequences shown 
in Table I. In the Table I oligonucleotides, bases were 
30 designated as follows. 



35 



45 



Base 




abbreviation 


base 


C 


5-meth ylcytosine 


U 


5- (1-propyny l)uiacil 


A 


adenine 


T 


thymine 


D 


5-(l-propynyl)c>'tosine 


Z 


structure (58) 


V 


structure (61) 


X 


structure (57) 


Y 


structure (60) 



50 

Structures of tricyclic bases described in this example and 
in following examples are as follows. 

55 (55) 



60 



65 





74 

-continued 

(60) 




J. , 

Phenoxazine THcyclic 
Cytidine 




The T„ values obtained from the RNA hybrids and the 
intracellular IC50 values for T-antigen inhibition derived 
from microinjection analysis are shown below. In all cases, 
P-galactosidase inhibition was concurrently tested as an 
internal cootrol and we saw no inhibition at 5 //M, the 
highest concentration tested. We measured inhibition of T 
antigen expression in CV-1 cells in tissue culture essentially 
as described (Wagner et al., "Science" 260:1510-1513 
1993). We measured the AT„ (° C.) values relative to control 
ODNl essentially as described (Jones et al. "JOC" 58:2983 
1993). 
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TABLE I 



ODN 










IC50 


# 


BASE 


ODN sequence 5' to 3' 




(^ 


1 


C control 


CCU-CXJU-CAC-UAC 




(65.0) 


n c 1 n 
U.J— l.U 


2 


D cootrol 


DDU-DDU-DAD-UAD 


SEQ ID NO. 3 


+13.0 


0.25 


3 


(57) 


XCU<XaJ-CAC-UAC 


SEQ ID NO. 4 


+2.5 


1.0 


4 


(57) 


CXU-CCU-CAC-UAC 


SEQ ID NO. 5 


+9.5 


0.05 


5 


(57) 


CCU-XCU-CAC-UAC 


SEQ ID NO. 6 


+6.5 


0.1 


6 


(57) 


CCU-CXU-CAC-UAC 


SEQ ID NO. 7 


+9.0 


0.05 


7 


(57) 


CCU-CCU-XAC-UAC 


SEQ ID NO. 8 


+6.5 


0,05-0.1 


8 


(57) 


(XU-CX:U-CAX-UAC 


SEQ ID NO. 9 


+5.0 


0.1 


9 


(57) 


CXrUCCU-CAC-UAX 


SEQ ID NO. 10 


+5.5 


2.0 


10. 


mismatch 


CCX-CCU-UAC-UAC 


SEQ ID NO. 11 


-115 


>0.5 


11 


T control 


CXT-CCTCAOTAC 


SEQ ID NO. 12 


-11.0 


>1.0 


12 


T-(57) 


CCT-CCr-XAC-TAC 


SEQ ID NO. 13 


-2.0 


0.25 


16 


D 


cx:u-cx:u-DAc-uAC 


SEQ ID NO. 14 


0 


0.5-1 


17 


(60) 


OCU-CCU-YAOUAC 


SEQ ID NO. 15 


+0.5 


>0.5 


18 


(58) 


CCU-CCU-ZAC-UAC 


SEQ ID NO. 16 


+5.0 


0.3 


19 


(61) 


CXrU-CCU-VAC-UAC 


SEQ ID NO. 17 


+1.0 


0.35 



*ATiQ Relative to control ODNl 



The results above show that the presence of an invention 
base elicits potent and specific antisense inhibition of target 
gene expression (compare ODNl vs. 0DN4 and 0DN6). 
Incorporation of one invention base at an internal C position 
resulted in a potency enhancement which exceeds that 
obtained from the substitution of seven 5-(l-propynyl) 
cytosine bases for 5-methylcytosine (compare 0DN2 and 
ODN4). The ODN12 results showed that oligonucleotides 
containing limited base substitutions and only a single 
invention base may have significant antisense potency. 
Phosphorothioate linked oligodeoxynucleotides containing 
propynyl-substituted pyrimidine bases were previously the 
most potent class of antisense agents that workers had 
described. 

Oligonucleotides containing a base of structure (3) at an 
internal position in the oligonucleotide where was (a) the 
(S) isomer of-O— CH2— C*H(CH3)— NH2 or (b) the (R) 
isomer of — O--CH2— C*H(CH3)— NH2 were prepared 
and tested for binding afl&nity in a similar manner. Both 
oligonucleotides had an increased binding affinity compared 
to the control oligonucleotide. 

EXAMPLES 

Antisense inhibition of p27*'^^ expression. The sequence 
of ODN13 was 5' UGGCUCUCCUGCGCC 3' (SEQ ID NO. 
18) and it contained phosphorothioate linkages, all sugars 
were 2'-deoxyriT)ose, all G positions contained guanine, all 
U positions contained 5-(l-propynyl)uracil and all C posi- 
tions contained 5 -(1-propynyl) -cytosine. 0DN14 had the 
same sequence and base composition as ODNl 3, except that 
all C positions contained 5-methylcytosine instead of 5-(l- 
propynyl)cytosine and thymidine instead of 5 -(1-propynyl) 
uracil. 0DN15 had the same sequence and base composition 
as 0DN14, except that the 9th C position from the 5' end 
contained a (57) base instead of 5-methylcytosine. Each 
ODN was tested for its capacity to inhibit p27 gene expres- 
sion in CV-1 cells in cell culture using cationic lipid to 
deliver the oligonucleotides into the cells, essentially as 
described (Coats "Science" 272:877-^0, 1996). 
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ODN 


IC5o(nM) 




13 


10 




14 


>20 




15 


<5 



The results showed that ODN15 containing one (57) base is 
more potent than the all-propyne derivative 0DN13. 
Previously, phosphorotbioate-linked oligonucleotides con- 
taining 5-l(piopynyl)-modified pyrimidine bases were the 
among the most potent reported class of antisense agents. 
35 0DN13 was tested in rats and, in a 10 day toxicological 
evaluation, it was found that the MTD (maximum tolerated 
dose) was 0.6 mg/kg/d i.v. 0DN15 was tested in the same 
manner with no toxicity observed at a dose of 6.0 mg/kg/d 
i.v. 

40 EXAMPLE 4 

Increased oligonucleotide binding afElnity and specificity. 
We made a 10-mer oligonucleotide, ODN22, having the 
base sequence, 5' TCTCCCTCTC 3' (SEQ ID NO. 19). 
0DN22 contained only phosphorothioate linkages, aU sug- 

45 ars were 2'-deoxyribose, bases designated T were thymine, 
bases designated C were 5-methylcytosine. 0DN23 was the 
same as ODN22, except that the 5th base position from the 
5' end contained 5-(l-propynyl)cytosine (base designated 
"D" in Table IV). ODN24 was the same as ODN22, except 
that the 5th base position from the 5* end contained a 

50 structure (60) base, ODN25 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (55) base. ODN26 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (57) base. ODN27 was the same as ODN22, except 

55 tiiat the 5th base position from the 5' end contained a 
structure (59) base. ODN28 was the same as ODN22, except 
that the 5th base position from the 5' end contained a 
structure (58) base. ODN29 was the same asODN22, except 
that the 5th base position from the 5' end. contained a 
structure (61) base. ODN30 was the same as ODN22, except 

^0 ihat the 5th base position from the 5* end contained a 
structure (65) base. 

We measured the T^ C C.) of ODN22-ODN30 using 4 
different oligonucleotides: A complementary RNA oligo- 
nucleotide (0DN31), an RNA oligonucleotide (ODN32) 

65 having adenine at the 11th position from its 5' end, i.e., a 
single A: test base mismatch, a complementary DNA oligo- 
nucleotide (ODN33), and a DNA oHgonucleotide (ODN34) 



